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Abstract Until recently, only about 10% of the total intracluster gas volume had been 
studied with high accuracy, leaving a vast region essentially unexplored. This is now 
changing and a wide area of hot gas physics and chemistry awaits discovery in galaxy 
cluster outskirts. Also, robust large-scale total mass profiles and maps are within reach. 
First observational and theoretical results in this emerging field have been achieved in 
recent years with sometimes surprising findings. Here, we summarize and illustrate the 
relevant underlying physical and chemical processes and review the recent progress 
in X-ray, Sunyaev-Zel'dovich, and weak gravitational lensing observations of cluster 
outskirts, including also brief discussions of technical challenges and possible future 
improvements. 
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1 Introduction 



A plethora of physical effects is believed to be acting in the outskirts of galaxy clusters, 
which ebbed away long ago in more central regions. This includes, e.g ., breakdown of 
equilibrium states like hydrostat ic equilibrium (e.g.jNagai et al.ll2007bh . thermal equi- 
librium and equi partition fe.g.. IFox fc Loea 1 19971 ). and ionization equilibrium (e.g., 
IWong et al 1l201lh . It is also in the outskirts, where structure formation effects should be 
widespread, resulting, e.g., in multitemperature structure and a clumpy gas distribution 
(Fig. [TJ. Moreover, the p rimary processes of intrac luster medium (ICM) enrichment 
with heavy elements fe.g. JSchindler fc DiaferiofeOQSl ) may be identified by determining 
the metal abundance up to the cluster outskirts. Last not least, future measurements 
of the evolution of the cluster mass function with ~100,000 galaxy clusters detected 
with the extended ROentgen Survey with an Imaging Telescop e Array (eROSITA, 
e.g. JPredehl et al.ll201ol ; IPillepich et al.ll20li : llVIerloni et al.ll2012r i will heavily rely on 
a detailed understanding of the cluster mass distribution. Therefore, tracing this distri- 
bution out to large radii will be important for using clusters as accurate cosmological 
tools. 

If cluster outskirts are so interesting, why haven't they been studied extensively 
with observations and simulations already long ago? In fact, we have only really seen 
the tip of the iceberg of the ICM up to now; i.e., the relatively dense central regions 
of galaxy clusters, the inner ~10% in terms of volume. The reason is, of course, that 
robust observations and realistic simulations are challenging in cluster outskirts. 

Why is that? The difficulties differ depending on the waveband used for cluster 
outskirt observations. For instance, the X-ray surface brightness drops below various 
fore- and background components at large radii, Sunyaev-Zel'dovich (SZ) effect mea- 
surements are also less sensitive in cluster outskirts where the gas pressure is low, and 
the weak gravitational lensing signal interpretation is increasingly plagued by projec- 
tion effects. Naively, simulations should be done most easily in outskirts because there 
the least resolution might be required. While this may be true for dark matter only 
simulations, it is not so simple if gas physics is included, e.g., the cooling of infalling 
subclumps. 

If it is so difficult, why has interest been rising in recent years? This is certainly 
mostly due to technical advances in observational and theoretical techniques but pos- 
sibly also to partially unexpected and sometimes controversial inital results. 

As is true for all articles in this review volume, we will put the emphasis on the 
ICM and total mass properties. The member galaxy and relativistic particle properties 
of cluster outskirts have been reviewed, e.g., in the Proceedings to the IAU c ollo- 
quium "Outskirts of Galaxy Clusters: Intense Life in the Suburbs" l|Diaferioll2004h and 
iBriiggen et alj (|201lh . respecti vely. Other use f ul reviews mostly about the ICM prop- 



erties of clu s ters i n clude, e.g., Sarazin dl986h; Borgani fc Guzzol (12001 Rosati et al . 
( 2002 ); IVoitl(l2005h:lArnaud1(|2005l ): lNormanl(|2005l ): lBorganil l|2008h : lBorgani fc Kravtswl 
( 201 lb ; ! Allen et alJ 



This article is organized in 7 Sections. Section [5] contains our definition of clus- 
ter outskirts, Section provides some basics on cluster mass determination, Section!?] 
includes a summary of the status of ICM profiles as well as descriptions and illus- 
trations of physical effects relevant for cluster outskirts, Section [5] outlines chemistry 
aspects, Sectionjfjsummarizes technical considerations for X-ray, SZ, and weak lensing 
measurements, and Section [7] gives a brief outlook. 



Fig. 1 Simulated galaxy cluster. The white circles indicate rspp, r200i r vin an d 3 r2oo moving 
outwards, respectively (adapted from Roncarclli ct al. 20(3). Left: X-ray surface brightness in 
the soft (0.5—2) keV band. The color scale spans 16 orders of magnitude and has been chosen 
to highlight cluster outskirts. Right: Temperature map on a linear scale from keV (blue) to 
11 keV (red). 

2 Where are the "cluster outskirts"? 

Let us define, which radial range we consider as "cluster outskirts." Readers not inter- 
ested in more details on the radial ranges can skip this section and just take note of 
our subjective choice: 

r 500 < cluster outskirts < 3r2oo , (1) 
where t^qq (defined below) used to be the observational limit for X-ray temperature 
measurements and the range up to 3r2oo captures most of the interesting physics 
and chemistry before clearly entering the regime of the warm-hot intergalactic medium 
(WHIM, Fig.[T]). This range also in cludes (i) the turn aro und radius, rturn = 2r v j r , from 
the spherical collapse model (e.g., iLiddle fc Lvthl feoool ) . (ii) part of t he infall regio n 
where caustics in galaxy redshift space are observed, several Mpc (e.g.. lDiaferio| [T999), 
(iii) much of the ra dial range where accretion shocks might be expected, (l-3)r v j r (e.g., 
Molnaretal ] l2009h . and (iv) the region where the two-halo ter m starts dominating ove r 
the one-halo term in the matter power spectrum, few Mpc (e.g.. lCoorav fc Shcth 2002). 

A theoretical recipe that can be used to define a cluster "bord er," "boundary," or at 
least a "characteristic" radius is the spherical collapse model (e.g.. lAmendola fc Tsuiikawal 
l2010l ). Based on this very idealistic model, a virial radius, r v j r , separating the virialized 
cluster region from the outer "infall" region, can be obtained by requiring the mean 
total mass density of a cluster, (ptot), to fulfill 

/ w , \ 3Aftot(< ^vir) Avir/ \ / \ /o\ 

<Ptot)(< r vir ) = — 3 = A c {z)p c (z) , (2) 

where p c (z) is the critical density of the Universe at redshift The virial overdensity, 
A^ lr (z), is a function of cosmology and redshift, in general (e.g., iKitavama fc Sutq 

1 Some authors use the mean matter density of the Universe, Pm(z) = fl m (z)p c (.z), instead 
of the critical density for their overdensity definition. 
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1 19961 : IWeinberg fc Kamionkowskill2003h . E.g., for a flat Universe with mean normalized 
matter density J? m = 1, Zic ir = (6tt) 2 /2 w 178 (at any z). Since 178 w 200, r Ac=200 = 
r 200 is sometimes used as a rather crude approximation to the virial radius (V200 ~ 
r v i r ). On the other hand, using a cosmology other than Einstein-de Sitter, A™ varies. 
E.g., for O m = 0.3 and Q A = 0.7, on e finds A™(z = 0) sa 1 01, Zic ir (.z = 0.5) » 138, 
and zi™(2 = 1) » 157 (using eq. 6 of lBrvan fe Normanlil99sh . 



Another possibility to define a virial radius (at least in simulations) is to use the 
region within which the condition of virial equilibrium (2£/j c in ~ —E po t) is satisfied. 

The virial mass is also the one that should be mea sured when comparing observa- 
tional cluster mass functions to the Press-Schechter (1974) mass function. However, 
when it became clear (e.g.. Iciovernato et al]|T999h that semi-analytic recipes for the 
mass function, like Press-Schechter, are not accurate enough for modern measurements, 
authors shifted to using p arametrized fits to mass functions as obt ained from numerical 
TV-body simulations (e.g.. I Jenkins et aDl200ll ; iTinker et al.l l2008.) to compare observa- 
tions to predictions (at the expense of loosing any analytic understanding of the mass 
function, of course). This removed the need to use virial masses for the comparison and 
it has become common practice to simply use a fixed value for A c in both observations 
and simulation^; i.e., 

3M t ot(< r A J 

4^3 c • ( 3 ) 

Typical overdensities used in the literature include A c = 100, 180, 200, 500, 666, 
1000, and 2500. Assuming an NFW l|Navarro et alJll997h profile with concentration, 



c = r 2 oo/r s = 4, one finds r vir (z = 0) « r wo « 1.36r 20 o, ISO ~ 1.05r 2 oo, ^500 « 
0.65r 2 oo, r 666 w 0.57r 2 oo, nooo ~ 0.46r 2 oo, and r 250 o « 0.28r 20 o- While it is obvious 
that using a fixed value for A c makes things simpler, this choice also requires picking 
a "magic" number: which overdensity to pick?, which is the best radius, compromising 
between simulations and obervations? It appears that, currently, a good choice would 
be in the range 500 < A c < 1000, where the lower limit comes from observations 
and the upper limit from simulations. An interesting number is then also the ratio of 
volumes within overdensities 500 and 100: 



'"500 



r vil (z = 0) 



0.1, (4) 



2 Note that this definition also leads to a funny effect when comparing cluster mass functions 
at different redshifts: assume an unrealistic Universe without evolution of clusters and their 
number density. For instance, at z = and at z = 1 we would then have the same number of 
clusters (with identical distributions of physical mass profiles) per comoving volume. Now, the 
measured M200 of two clusters with identical mass profiles at both redshifts differ, the M200 of 
the higher redshift cluster being smaller because p c (z = 1) = E 2 (z = l)p c (z = 0) ~ 8p c (z = 0) 
for a "concordance cosmology" and, therefore, the mass profile gets integrated only to a much 
smaller physical radius for the higher redshift cluster. Plotting the mass functions at z = 
and z = 1 would then result in a lower number density at higher redshift. This effect is further 
enhanced when the mean density is used for the overdensity definition instead of the critical 
density or if the virial overdensity is used. So, clearly, the definition of outer radius has a 
strong effect on the perceived evolution of the mass function. Since the choice of A(z)p(z) is 
arbitrary (it just has to be consistent between observations and predictions) and if one wanted 
to appreciate the pure number density evolution of the mass function from a plot more directly 
one could, e.g., use masses defined with a fixed overdensity with respect to the critical density 
at z = for all clusters; i.e., make both A and p redshift independent. Note that this would 
still ensure that, at a given redshift, low mass clusters would be treated in a way that allows 
comparison to high mass clusters, which would be less obvious if a metric radius (e.g., the 
Abell radius) was used. 
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which implies that measurements limited to rsoo explore only about 10% of the total 
cluster volume! 

Due to their high particle backgrounds, Chandra and XMM-Newton are basically 
limited to <r5oo for robust gas temperature measurements. As we will see in Sec- 
tion [JT21 Suzaku now routinely reaches ~r2oo- Swift may soon follow for a few bright 
clusters. ROentgenSATellit (ROSAT) and SZ (stacking) observations constrain well gas 
density and pressure out to ~r2oo, respectively (Sections 14.1.11 and 14. 1.3) . Currently, 
we cannot observationally reach the outer border of our definition of cluster outskirts, 
3f"200! leaving ample discovery space for the future. 



3 Mass 

In the contributions by Ettori et al. and Hoekstra et al. of this volume, detailed reviews 
on cluster mass reconstruction are provided. Here, we summarize some basics that are 
important for our discussion of cluster outskirts. 



3.1 Total mass inferred from ICM properties 

The total mass of galaxy clusters can be determined by measuring ICM properties, like 
density, temperature, and pressure. We will see in SectionUthat a clear understanding 
of the gas physics is required for accurate mass determinations. In Section|5] examples 
are given that illustrate technical challenges that need to be overcome to understand 
gas physics in cluster outskirts. 

Under the assumptioijf] that the ICM is in hydrostatic equilibrium with the gravi- 
tational potential, the integrated total mass profile, Mt t(< r), is given by 

1 dP _ GM t ot(<r) 
Pgas dr r 2 

where P is the gas pressure, p gas its density, and G the gravitational constant. Applying 
the ideal gas equation, P = ^-p g as2gas, results in 

Mtot (< r) = -M^ ( + , (6) 

G[im p \ dlnr dlnr J 

where \x as 0.6 (Section I4.7f) is the mean particle weight in units of the proton mass, 
ra p , and k-Q is Boltzmann's constant. So, the total mass within a given radius depends 
on the gas temperature at this radius, as well as the temperature gradient, and the gas 
density gradient. Note there is no dependence on the absolute value of the gas density, 
only on its gradient. 



3 Other, mostly minor, assumptions that we will not discuss include: gravitation is the only 
external field (e.g., no magnetic field), clusters are spherically symmetric (e.g., do not rotate), 
no (pressure supplied by) relativistic particles, fi is independent of r (e.g., negligible helium 
sedimentation, constant metallicity), Newtonian description of gravity is adequate (e.g., no 
relativistic corrections), the effect of a cosmological constant (dark energy) is negligible. 
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3.1.1 X-ray measurements 



The hot ICM is collisionally highly ionized and mostly optically thin. Using X-rays, 
the gas density and temperature profiles can be determined. At temperatures k^T e > 
2 keV0 and typical ICM metallicities (0.1-1 solar) thermal bremsstrahlung (free- free) 
emission is the dominant emission process. The emissivity; i.e., the energy emitted per 
time and volume, at frequency v is given in this case by 

£ ocnln^e"^ , (7) 

where n c (oc p ga s) and T e denote electron number density and temperature, respec- 
tively. So, fitting a model to a measured X-ray spectrum yields density and temperature 
of the hot electrons. At lower temperatures (k^T e < 2 keV), line emission becomes 
important or even dominant and serves as an additional temperature discriminator 
(Fig.0). Note also that the abundances of heavy elements and the cluster redshift can 
be constrained by modelling the line emission. 

So, to determine the total mass out to large cluster radii, one needs to measure 
gas density and temperature profiles in low surface brightness outer regions. There, 
not only are the measurements themselves quite challenging but also several physical 
effects may become important that can usually be ignored in inner parts. Both issues 
will be discussed in some detail in this review (Sections [6] and [4] respectively). 

As we will see, gas temperatures typically decline with radius in the outer regions 
of clusters. Equation (0 shows that both the absolute value as well as the gradient 
of the temperature at a given radius contribute, and they work in opposite directions: 
for a declining temperature, the former term decreases the total mass while the latter 
increases it. It will be of interest in the course of this article, which term usually dom- 
inates. To illustrate this, we show in Fig. [3] how M200 changes depending on the slope 
of the temperature profile, for a simple model cluster with a density profile following a 
single beta model with /? = 2/3 and a core radius of 150 kpc, and a temperature profile 
T(r) = To(r/r cu t)~ 7 with Tq — 6 keV and r cu t = 400 kpc. One notes immediately 
two things: first, the absolute value of the temperature at a given radius is much more 
important than its gradient because the steeper the temperature profile the lower the 
total mass, and, second, the temperature profile can have a significant impact on the 
total mass determination even if its gradient is much smaller than the gas density gra- 
dient. The density gradient effect is clear from ([3}: the steeper the profile, the larger the 
total mass. Recall in this context that convection will set in if the gradient of specific 
entropy becomes negative, so hydrostatic equilibrium is likely not a good assumption 
if -dlnTga S /dr > -2/3dlnp gas /dr (Section 



3.1.2 SZ measurements 



The hot ICM electrons emitting in X-rays also change the intensity of the cosmic 
microwave background (CMB) radiation via inverse Compton scattering. The charac- 
teristic features of this spectral distortion of the CMB were predicted by R. Sunyaev 
and Y. Zel' dovich shortly after the dis covery of X-rays from clusters, and is named 



after them ( Sunyaev fc Zeldovichl ["19721 ). The distinguishing feature of this effect is a 



decrement of CMB intensity below ~220 GHz, where clusters appear as a dark spot or 



4 T e is often used synonymously to k^T c =>■ 1 keV ~ 1.16 X 10 7 K; X-ray photon energies 
are also typically expressed in keV; 1 keV « 2.42 xlO 17 Hz m 12 A A. 
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Fig. 2 Hot gas model X-ray spectra including free-free, free-bound, and bound-bound emission 
as a function of fee 7c (black: 1 keV, red: 3 keV, green: 9 keV). The emission measure was kept 
fixed for all spectra; i.e., all spectra assume the same electron density distribution. Notice 
how the exponential bremsstrahlung cutoff shifts to higher energies for higher temperatures, 
as expected from Q. Note also the Fe L and K shell emission line complexes at ~1 and ~6 
keV, respectively. The line strength depends on metallicity (here assumed to be 0.4 solar) and 
temperature. For hot clusters (fceT e > 2 keV), the emission in a soft energy band, e.g. (0.5-2.0) 
keV, is almost independent of T e . 



a "hole" in the microwave sky, and an increment above 220 GHz. This effect is more 
precisely called the thermal Sunyaev-Zel'dovich (tSZ) effect, to distinguish it from 
the scattering signal caused by the bulk motion of the intracluster gas (the kinematic 
Sunyaev-Zel'dovich, kSZ, effect). The latter has more than an order of magnitude lower 
amplitude than the thermal effect, and for the rest of the discussion we will specifically 
focus on the thermal SZ effect only. 

Excellent r eviews for the SZ effec t and i t s cosm ological applications are given, e.g., 
by iBirkinshawl |l999) and ICarlstrom et al. (2002). Recent advances in detector tech- 
nology have made the first blind detection of SZ clusters possible ( Staniszewski et al] 
2009), and three large-scale experiments are currently in operation which are providing 
many more SZ selected clusters out to z = 1 and beyond: the South Pole Telescope 
( Vanderlinde et al]|2O10l), the Atacama Cosmology Telesc ope ( Marriage et al.l [201ll ') 
and the Planck satellite ( Planck Collaboration et al.ll2011bh . 



The signal of the SZ effect is directly proportional to the integrated pressure of 
the intracluster gas along the line-of-sight, which is measured as the Comptomzation 
parameter, y. The change in the background CMB intensity is thus .47"cmb/^CMB = 



s 




f(y)y, where /(f) is the spectral shape function, and 



y= I ^l a^nedl . (8) 



Here m c is the electron mass and o"x is the Thomson scattering cross-section. The 
integration is along the line-of-sight path length dZ. For typical ICM temperatures and 
densities, the relative change in CMB intensity is small: 4/cmb/^CMB ~ 10 -4 — 10~ 5 . 
The major advantage of the SZ effect comes from its redshift independence, since the 
signal is the result of scattering of the background CMB photons, and both scattered 
and un-scattered photons redshift together. This puts the SZ effect in contrast to all 
other astrophysical signals, for example the (1 + z) 4 dimming of the X-ray surface 
brightness (eg. I19p . In practice, however, this redshift independence is currently not 
fully exploitable due to finite beam sizes. Another major advantage is the linear depen- 
dence of the SZ signal on electron density, as opposed to the dependence of X-ray 
brightness, which potentially makes it more suitable to study the low density outskirt 
environments. 

SZ measurements can be used in at least two different ways to determine the total 
mass from ICM properties. Both methods, however, do require additional information 
from X-rays since constraining the SZ observable, pressure integrated along the sight 
line, alone is insufficient to apply the hydrostatic equation (|5j). 

The first method aims at directly measuring the cluster pressure profile. By adding 
the gas density profile from X-ray observations, the hydrostatic equation can be applied. 
In the second method, density and temperature profiles are determined simultaneously 
from joint X-ray / SZ modelling. More d etails of both methods are described in Sec- 
tion [6]2] See also iLimousin et al.l ( 20131 . in this volume) for a review on combining 
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X-ray, SZ, and gravitational lensing measurements to constrain the three-dimensional 
shape of clusters. 



3.2 Total mass inferred from weak gravitational lensing 

To first approximation, gas physics can be ignored for weak lensing mass reconstruc- 
tions. This simplifies the situation considerably if one is only interested in cluster mass. 

Weak gravitational lensing offers an alternative route to measuring cluster mass 
profiles, independent of the physical state and nature (dark or luminous, baryonic or 
non-baryonic) of the matter. One exploits the spatial correlation of weak shape distor- 
tions of background galaxies induced by a cluster's gravitational potential. The weak 
lensing observable, the so-called reduced shear g(8) as a function of the lens-plane po- 
sition fljs_£Oimected_to th^projected surface mass densitjjf] k via a non-local relation 
fe.g.. iKaiser fc Squired fl993;). Cluster masses c an be inferred from the reduced shear 
profile p(6) by either fitt ing with a profile fe.g.. lNavarro et al.lfl996l : lBartelmannlll996l : 
IWright fc Brainerdllioool , NFW) function or by directly inverting the sh ear-mass prob- 
lem. An early direct method, the aperture densitometry or (f-statistics (IFahlman et ah! 
Il994f ). spawned the development of the aperture mass estimator |Schneider[ ]l9*9*6[K 
which is mainly used to detect mass overdensities via weak lensing. ISeitz fc Schneider! 
(2001) developed a mass reconstruction algo rithm computing a two-dime nsional con- 
vergence map from an input shear catalogue. lUmetsu fc Broadhurstl ( 20081 ) introduced 



a maximum entropy method tackling the same problem from a Bayesian viewpoint 
and present mass profiles for a well-studied lensing cluster, Abell 1689, using different 
lensing methods. 

While weak lensing shear profiles can be measured as far out as the field-of-view of 
the camera permits, the cluster signal slowly sinks into the cosmic-shear background 
caused by lensing due to uncorrelated large-scale structure. We address this topic in 
greater detail in Section [6.3.11 As th e mass enclosed w ithin a sphere described by an 



at large radii. lOguri fc Hamanal 1 2011T ) provide the corresponding lensing profile which 
they find to give a better representation of the cluster shear obtained by ray tracing 
through an A"-body numerical simulation. 

A further practical limitation to the precision of weak lensing mass profiles arises 
from the considerable intrinsic and observational scatter in galaxy ellipticities, which 
dominates over the shear signal outside a certain radius depending on both the cluster 
mass and the lensing efficiency (e.g., Hoekstra et al., this volume). 



NFW profile diverges logarithmically. [ Baltz et al] (|2009h introduced a smoothed cut-off 



3.3 Total mass inferred from galaxy velocities 

While this review focusses on cluster outskirts mass estimates through ICM and weak 
lensing measurements, a tremendous amount of work has been done using galaxy ve- 
l ocites. Indeed , the first robust hints on the existence of dark matter are due to them 
Zwicky|[l933T ). As a simple example, assuming virial equilibrium, the total cluster 
mass can be related to the radial galaxy velocity dispersion through Mtot ~ ra 1 jG. 



5 Defined as k = E / ' J7 CI .; t in terms of the critical surface mass density. 
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Masses have been estimated for large samples of galaxy clusters thro ugh galaxy ve- 
locites, resulting in cosmological constraints, scaling relations et c. fe.g.. iBiviano et all 
ll993l ; lGirardi et al.lll998l ; lBorgani et"al]|l999l ; [Zhang et al.ll20l"ll ). 

Of particular interest for cluster outskirts is the so-called caustics method (e.g., 
Diafcrio & Gellerlll997l ; |Piaferid[l999l ) , which has been used to infer cluster masses out 
to lar ge radii without equilibrium assumptions fe.g jRines et aLll2003l . l2013l ; lRmes fc Diaferiol 
2006). The method is based on the measurement of sharp, trumpet-like features in red- 
sh ift space as a function of cluster centric distance in cluster infall regions (e.g., Fig. 5 
m iKaiserll 19871 ). The amplitude of these "caustics" depends on the escape velocity and, 
therefore, the mass. 



4 Gas physics 

In this Section, we discuss several physical effects that may influence the uncertainty of 
the X-ray mass determination in cluster outskirts. While observations and simulations 
are always discussed in the following when relevant, we summarize the status of ICM 
density, temperature, pressure, and entropy profiles as well as the gas mass fraction in 
the first Section (|4~Tj) . 



4.1 Overview of ICM properties in cluster outskirts 
4-1.1 Surface brightness and gas density profiles 



As has been summarized recently bv lEttori fc Molendil ( 201lh , the X-ray surface bright- 
ness is a quantity much easier to characterize than the temperature and it is rich in 
physical information being proportional to the emission measure, i.e. to the square of 
the gas density, of the emitting source. Thanks to its large field-of-view and low instru- 
mental background, ROSAT PSPC is still the main instrument for providing robust 
constraints on the X-ray surface b rightness profile of ga l axy clusters ove r a significant 
fract ion of the virial radius (e.g., IVikhlinin et al.lll999l ; iNeumaiml 120051 ; lEckert et alj 

l2orA 



Vikhlinin et al.l (Il999l ) found that a /3-model with /3 = 0.65-0.85 (i.e., a power-law 



slope in the range -2.9 to -4.1; from S x (r) « r 2 ( - 5 - 3 ^) = r 1_6/3 ) described well the 
surface brightness profiles, Sx{r), in the range (0.3-1) riso of 39 massive lo cal galaxy 
cluste rs observed with ROSAT PSPC in the soft X-ray band, (0.5-2) keV. iNeumannl 
( 20051 ) found that the stacked profiles of a few massive nearby systems located in regions 
of low (<6 x 10 20 cm~ 3 ) Galactic absorption observed by ROSAT PSPC provide values 
of /3 around 0.8 at r2oo> with a power-law slope that increases from —3, when the fit is 
do ne over the radial range ( 0.1-1) r2oo, to —5.7^^2 over (0.7-1.2) r2oo- 

lEttori fc Balestral ([2009) studied the X-ray surface brightness profiles at r > rsoo 
of 11 objects extracted from a sample of hot (Tgas > 3 keV), high-redshift (0.3 < z < 
1.3) galaxy clusters observed with Chandra. They performed a linear least-squares fit 
between the logarithmic values of the radial bins and the background-subtracted X- 
ray surface brightness (Fig. [4j . Overall, the error- weighted mean slope is —2.91 (with 
a standard deviation in the distribution of 0.46) at r > 0.2 r2oo and —3.59 (0.75) at 
r > 0.4 r2oo- For the only 3 objects for which a fit between 0.5 r2oo and Rs2N, the 
maximum radius out to which the cluster surface brightness could be measured with 
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a signal-to-noise ratio of at least 2, was possible, they measured a further steepening 
of the profiles, with a mean slope of —4.43 and a standard deviation of 0.83. They 
also fitted linearly the derivative of the logarithmic Sx(r) over the radial range 0.1 
r 200~Rs2N i excluding in this way the influence of the core emission. The average (and 
standard deviation a) values of the extrapolated slopes are then —3.15 (0.46), —3.86 
(0.70), and —4.31 (0.87) at 0.4r2oo, 0.7r2oo and t"2oo, res P ec ti ver y- These values are 
comparable to what has been obtained in previous analyses of local systems through 
ROSAT PSPC exposures and are supported from the studies of the plasma's properties 
in the outer regions of hydrodynamically s i mulated X-ray em itting galaxy clusters (e.g., 
iRoncarelli et al.ll2006r . fNaeai fc Laull201ll : IVazza et alJboilh . 
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Fig. 4 From left to right: (i) Chandra surface brightness profile of MS0015.9 with the fitted 
background (horizontal dotted line) and the radius r2oo [vertical dashed line); (ii) signal-to- 
noise profile, evaluated as S2N = (5x — B)/e as function of r/r200, where the error e is the 
sum in quadrature of the Poissonian error in the radial counts and the uncertainties in the 
fitted background, B, defined by considering a region far from the X-ray center that covered a 
significant portion of the exposed CCD with negligible cluster emission; (iii) the best-fit values 
of the slope of the surface brightness profiles as a function of r/r2oo- The thick horizontal 
solid line indicates the slope evaluated between 0.4 r200 and Rs2N with a minimum of 3 
radial bins. The dashed line indicates the best-fit of dln(S-a) / dln(r /r20o) with the functional 
form sq + si ln(r/r2oo) over the radial range 0.1 r2oo— Rs2N (reprinted with permission from 
lEttori fc B alcstra 20Q2). 



Indeed, modelling cluster outskirts with hydrodynamical simulations is generally 
considered an easier task with respect to cluster cores. The reason is that any form 
of feedback effect is usually connected with active galactic nuclei (AGN) and star- 
formation and, therefore, with gas density, thus leaving the external regions more 
dominated by gravitational collapse. In fact, it has been shown with hydrodynamical 
simulations that different physica l presc riptions have a small impact on clusters' profiles 
in the outskirts. IRoncarelli et all ( 20061 . see also Fig.[TJ studied the behavior of the pro- 
files of density, temperature and surface brightness in a sample of 9 galaxy groups and 
clusters simulated with the Lagrangian code GADGET-2, with non-radiative physics 
and with several treatments of cooling, star-formation, feedback, viscosity, and thermal 
conduction. They found a clear steepening of all the profiles around the virial radius 
with very small differences due to the physical models. In detail, the slope of the soft, 
(0.5-2) keV, X-ray surface brightness changes from about —3.2 close to the center to 
about —5.5 in the outskirts for high-mass objects, with a slope of —4, —4.5, —5.2 when 
estimated in the radial range (0.3-1.2) r2oo 5 (0.7-1.2) r2oo, (1.2-2.7) r2oo, respectively, 
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in excellent agreement with the observational constraints. Moreover, on the scale of 
galaxy groups (M200 ~ 10 14 iW©), the steepening is much more prominent, with slopes 
that vary from about —2.5 to —7.5. A similar behavior is measured in the profiles of 
the gas density and temperature. These evidences can be explained as due to the fact 
that, in the present scenario for structure formation, galaxy groups are dynamically 
older and more relaxed than clusters, as also suggested from the observ e d and simu- 
lated behavior of the co ncentration-mass relation (e.g.. iNeto et al.ll2007l ; iBuote et al.l 
120071 : lEttori et alj|2010h . 

However, simulations highlight that the accretion pattern in the outskirts is indeed 
complicated. Non-adiabatic effects like turbulence and shocks are frequent in unrelaxed 
systems and can influence the pressure profiles. Even if these processes are not con- 
nected to any feedback of the star-forming phase, they often require special treatment 
in the simulations because they act at smal l scale s that are difficult to reach in cos- 
mological simulations. Recently. IVazza et al.l ( 20091 ) ran simulations with the Eulerian 
adaptive mesh refinement (AMR) code ENZO including the implementation of a new 
sub-grid refining scheme, allowing them to study the velocity pattern of the ICM in 
simulated clusters. Their results showed that the kinetic and turbulent energy associ- 
ated with the ICM account for 5-25 per cent of the total thermal energy inside r v j r . 
iBurns et all §010) found a remarkable agreement between the temperature, density, 
and entropy profiles of 24 mostly substructure-free massive clusters simulated with the 
ENZO code and the published Suzaku results, implying that (i) the simplest adiabatic 
gas physics is adequate to model the cluster outskirts without requiring other mech- 
anisms (e.g., non-gravitational heating, cooling, magnetic fields, or cosmic rays), and 
(ii) the outer regions of the ICM are not in hydrostatic equilibrium. 

Inhomogeneities in the X-ray e mission due to random density fluctuations are 
expected. Using simulated clusters, iMathiesen et all (| 19991 ) measured a mean mass- 
weighted clumping factor C = (pgas)/(/ 3 gas) 2 between 1.3 and 1.4 within a density 
contrast of 500. If clumping is ignored, the gas density, and therefore, the gas mass, 
gets overestimated by \[C. iNagai fc Laul (|201ll ) analysed 16 simulated clusters and 
computed their dumpiness profiles with and without radiative cooling. They found 
that, at r200) C goes from 1.3 in the former to 2 in the latter ones. The non-radiative 
clumping factor is higher because radiative cooling removes gas from the hot T > 10 6 
K X-ray emitting phase in the simulations. This shows that adding gas physics has 
quite a significant effect in cluster outskirts. The authors suggest that, since hydrody- 
namical simulations suffer of a form of overcooling problem, the true result is likely 
to be between these values. Overall, they concluded that gas clumping leads to an 
overestimate of the observed gas density and causes flattening o f the entropy profile , 
as suggested qualitativ ely from recent Suzaku observations (e.g.. Illoshino et al.l I2OI0I : 
ISimionescu et al1l201lh . 

Suzaku has indeed improved the observational characterization of the faint emission 
from cluster outsk i rts, p roviding the first spectroscopic constraints on it (Section ^. 1.21) . 
ISimionescu et all l|201ll ) have resolved the baryonic and metal content of the ICM 
in the Perseus cluster out to r2oo ~ l.8h^ Q Mpc, estimating a clear excess of the 
gas fraction with respect to the expected cosmic value alon g the northwest ax is, and 
confirming some preliminary evidence from ROSAT PSPC ( Ettori et al.l 1 199^ . where 
the gas density is resolved at r2oo but an isothermal gas is assumed due to limited 
spectral capability of the PSPC; see Fig.0. They suggest that dumpiness of the ICM 
on the order of C w 4-16 over the radial range (0.7-1) r2oo is required to properly 
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reconcile the expected and measured gas mass fraction; i.e., significantly higher C than 
indicated by the simulations described above. 



West 




r (Mpc) 

Fig. 5 Left (from Simioncscu ct al. 2011, reprinted with permission from AAAS): Deprojected 
electron density, entropy (K), and pressure profiles towards the northwest (red data points) 
and east (blue data points) of the Perseus cluster, measured with Suzaku. The red line shows 
the northwestern profiles corrected f or clumping. The expected entropy profile from simulations 
of gravitational collapse (Voit 2005) is a power-law with index 1.1 over-plotted in the entropy 
panel (black dotted line). The average profile of a sample of clusters previously studied with 
XMM-Newton within ~0.5 T200 dArnaud et al.l2010l ) is shown in the pressure pane l (solid black 
curve ; its extrapolation to r200 is shown with a dotted black line). Right ("from lEttori et aD 
1998, reprinted with permission): Gas mass fraction profiles obtained from the deprojection 
of the ROSAT PSPC surface brightness profiles in different azimuthal sectors by assuming a 
Navarro- Frenk- White (NFW) dark matter density profile (triangles joined by a dotted line). 
The gas fraction measured from the azimuthally averaged profile, both using an isothermal 
profile (solid line) and an NFW profile (dashed line), is shown. r200 = 2.7 Mpc in this analysis. 
An Einstein-de Sitter Universe with Hq = 50 km s _1 Mpc -1 is here adopted. 

A more observable quantity is the estimate of the azimuthal scatter along the 
cluster radius S(r) — ^Jl/N ^y^i''^ , where yt(r) is the radial profile of a 
given quantity, taken inside a given i— sector, and Y(r) is the average profile taken 
from all the cluster volume. Contributions to this scatter are expected from both an 
intrinsic deviation with respect to the spherical symmetry and, most importantly, to 
the presence of filaments of X-ray emission associated to any preferential direction in 
the accretion pattern of the ICM. 

Two sets of high-resolution cos mological re- s imula tions obtained with the codes 
ENZO and GADGET2 are used in IVazza et all (|201lh to show that, in general, the 
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azimuthal scatter in the radial profiles of X-ray luminous galaxy clusters is about 10 
per cent for gas density, temperature, and entropy inside r^ocb and 25 per cent for X- 
ray luminosity for the same volume. These values generally double approaching 2 r2oo 
from the cluster center, and are found to be higher (by ~20-40 per cent) in the case of 
perturbed systems. These results suggest the possibility to interpret the large azimuthal 
scatter of observables as estimated from, e.g., Suzaku with the present simulated data. 




Fig. 6 Comparison between the mean ROSAT density profile for a sample of 31 nearby X- 
ray bright galaxy clusters and numerical simulations both including cooling and star formation 
(left) and with non-radiative physics (center). Right: Stacked emission measure profile (in units 
of cm -6 Mpc) for the entire sample (black), the cool-core (CC, red) and non-cool-core (NCC, 
blue) systems. The bottom pa nel shows the ratio between the CC and NCC populations. 
Reprinted with permission from lEckert et al 



In lEckert et al. I HoH), a stacking analysis of the gas density profiles in a local 



(z — 0.04-0.2) sample of 31 galaxy clusters observed with ROSAT PSPC is presented 
(Fig. [6j). They observe a steepening of the density profiles beyond ~r5oo- They also 
report the high-confidence detection of a systematic difference between cool-core and 
non-cool-core clusters beyond ~0.3 r200j which is explained by a different distribution 
of the gas in the two classes. Beyond rsoo, galaxy clusters deviate significantly from 
spherical symmetry, with only little differences between relaxed and disturbed systems. 
The observed and predicted (from numerical simulations) scatter profiles are in good 
agreement when the 1% densest clumps are filtered out in the simulations. While all the 
different sets of simulations used by them, especially beyond ~0.7 r2oo, show a relatively 
good agreement, they seem all to predict steeper profiles than the observed one from the 
PSPC, in particular in the radial range between 0.4 and 0.65 r2oo- Appr oaching r2oo, 
the slope increases up to /3 « 1 both in simulated and observed profiles. lEckert et al.l 
also conclude that a slightly better agreement in terms of shape of the gas density profile 
is foun d when a treatment of the observational effects of gas clumping is adopted (as, 
e.g., in lNagai fc Laull201l[ ). 

At very large radii up to ryir, Suzaku data seem to indicate a flattening of the 
gas density profile fe.g.. iKawaharada et alJuOlOl l. Recall (eq. [6j) that the gas density 
gradient enters linearly in the total mass determination. A flatter profile, therefore, 
results in a lower total mass estimate (and a larger gas mass estimate). 
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4- 1.2 Temperature profiles 



We have seen (e.g., Fig. [3| that the inferred total cluster mass strongly depends on 
the measured temperature profile. This is mostly because T gas (r) enters linearly in the 
hydrostatic equation (J5J), therefore, a 20% uncertainty in T sas (r) results in a mass un- 
certainty contribution of 20% if a fixed radius is chosen (~30% if the mass is determined 
within an overdensity radius). 

Despite the poor, energy-dependent point-spread-function (PSF), the Advanced 
Satellite for Cosmology and Ast rophysics (ASCA) provided temperature profiles to 
large radius for som e clusters fe.g.. lMarkevitch et al.ll9 96l:lFukazawall997l;lMarkevitch et al.l 



1998; 



2000 



White 



2000f), as well as Be ypoSAX (e.g., llrwirTetlrtJ Il999l ; llrwhi^TlBregmanr 



De Grandi fc Molendil liooj ). Due to their high particle backgroun ds, XMM- 



Newton and Chandra usually cannot robustly r e ach the cluster outskirts i 
200l |; IZhang et al.ll2004l; IVikhlinin et al]|2005l: lArnaud et al.l 120051: iKotov fc Vikhlinin. 
2005l;|Piffaretti et al.ll2005l ; IPratt et al.ll2007l ; ISnowden et al.ll2008l ; beccardi fc Molendil 




2008 bj), apart from a few special, e.g., v ery bright, low temperature, systems (e.g., 



Urban et al1l201ll : iBonamente et alj|2013h 



The breakthrough came recently aft er the launch of Su zaku, a satellite in low- 
Earth orbit and with short focal length ( Mitsuda et al.l [2007b . resulting in a low and 
stable particle background. The first Suzaku temperature meas urements reaching be - 
yond the XMM-New t on and Chandra lim it were publis h ed by iFuiita et alj (|2008al ) ; 
IReiprich et"ai] (|2009l ) : iGeorge et all (|2009h : IfBautz et all (12003). T he latter three are 
all based on relaxed cool-core clusters, while Fuiita et a l. (2008a) targeted the com- 
pressed and heated interaction region between A399 and A401 with the primary goal 
to constrain the metallicity (Section [SJ . While initially there were sometimes difficul- 
ties to properly account for all fore- and background components (see the technical 
Section 16.1.11 for details on these components), especially for clusters at low Galactic 
latitude IjGeorge et alj 120091 ; lEckert et al.l 120111). more e laborate robust analyses are 
now routinely being performed (e.g., Walker et al.ll2012al ). 

The ~100 refereed articles that are turned up by ADS^I when searching for "Suzaku" 
and "cluster" in the abstract have already received > 1,000 citations at the time of 
writing (December 2012), demonstrating the large interest in Suzaku cluster studies. 
The six most highly cited references of these all deal with cluster outskirts, which shows 
that this interest is driven particularly by this subject. 

In Fig. [7] all currently available Suzaku cluster temperat ure profiles are shown t hat 
reach beyond about 1/2 r2oo (to our knowledge; see also lAkamatsu et al.l l201l] for 
an earlier compilation) Q The purpose of this compilation is to test for similarities of 



e Astrophysics Data System, http://adsabs.harvard.edu/abstract_service.html. 

7 The data were thankfully provided electronically by H. Akamatsu (A2142, 
lAkamatsu et alj 120111, as well as A3667, A3376, CIZA2242. 8-5301 , and ZwC12341.1- 
0000. lAkamatsu fc Kawaharal 120131 ; A. Fabian and S. W alker (A2029, IWalker et all l2012d , 
and the reanalysis of PKS0745-191, IWalker et al.ll2012al. up dated from the inital results of 
Georee et aT1l2009D; A . Hoshino (A1413. lHoshino et a.l.ll2010f); P. Humphre y (RXJ1159+5531, 
Humphrey et alJl2012f) : M. Kawaharada" (A1689. | Kawaharada et alJl2010h : E. Miller (A1795, 
Bautz et al.ll2009f); T. Reiprich (A2204, IReiprich et alj 120091) ; K. Sato (A2811, A2804, and 
A2801, ISato et al.ll2O10h; T. Sato (Coma.lSato et alj|2011l . and Hydra A. ISato et aTll2012fi ; A. 
Simionescu (Perseus , | Simioncscu et al. 20111). Note that for several clusters more than one data 
set is shown, each covering a different azimuthal direction within a cluster. After acceptance 
of this review, two other Suzaku t emperature profile p apers appea red, which are not incl uded 
in the compilation above (A1835, Ichikawa ct al. 2013, and Coma, Simioncscu et al.H2013l) . 
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Fig. 7 All published Suzaku temperature profiles that reach cluster outskirts. The references 
are listed in the text. 



the temperature profiles and, if similar, to determine the general trend and compare to 
pre dicted profiles. Befor e proceeding however, a few words of caution are in order: While 
the lArnaud et al. (2005) (fcBT)-r2oo relation (for their hot cluster subsample, assuming 
a flat Universe with Hq = 71 km/s/Mpc and O m = 0.27) was used homogeneously for 
all clusters for the radial scaling, the shown data are inhomogeneous in several other 
aspects. These include, e.g., background subtraction, PSF correction, deprojection, 
(fegT) determination, and radial bin center calculation, so this will cause increased 
dispersion in the profiles. Also, both axes are not completely independent since both 
are scaled by (k-gT) {{k-QT} 1 ^ 2 for r2oo), which further increases dispersion (due to the 
negative slope of the profiles). So, even if the true scaled cluster temperature profiles 
were perfectly self-similar, we expect to see dispersion in Fig. [7] 

The vertical bars shown are the 68% confidence level statistical uncertainties. Some 
authors provided also total systematic uncertainties. Typically, in the outer parts, they 
are of roughly similar size as the statistical errors. The horizontal bars indicate the 
radial range used for accumulating the spectra. 

The authors were asked to flag their clusters or azimuthal directions as either "re- 
laxed" or "merging." In Fig.[8]both sets of profiles are compared (without error bars, for 
clarity). While there appears to be more scatter in the profiles of the clusters flagged 
as merging, this is mos tly due to A3376 and A2804. In the merging cluster A3376, 
lAkamatsu et alJ ( 2012bh clearly identified a shock front, which can be appreciated in 
Fig. [8] (right) as the profile with a maximum around (0.7-0.8) T2oo- A2804 is a group 
that lies between two hotter clusters ( Sato et al]|2010h , which seems to cause an untyp- 
ically flat temperature profile in the outer parts (second highest relative temperature 
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at ~0.8 r2oo)- Overall, the trends of the relaxed and merging profiles are rather similar 
and in Fig. [9] all profiles are therefore combined, excluding only those of A3376 and 
A2804. 




Fig. 8 Suzaku temperature profiles of clusters flagged as relaxed (left) and merging (right). 



In the inner region (<0.3 T2oo) the profiles appear flat. This is due to the large 
spread and radially varying slop es of central temperature profiles as observed with 
Chan dra and XMM-Newton (e.g JVikhlinin et al.ll2005l : IPratt et al.ll2007l : lHudson et all 
I2010I ) combined with Suzaku's broad PSF. Beyond this central region, temperatures 
systematically decline by a factor of about three out to T2qo and slightly beyond. 

The Suzaku average profile is compared to profiles predicted by JV-body plus hy- 
drodynamic simulat ions ( Roncarelli et al.l l200d ; iNagai et al.l l2007al ; iBurns et al.l l20ld ; 
IVazza et ai]|201ll) . employing different numerical algorithms and incorporating differ- 
ent physics (Fig. [9l right). Before proceeding, it needs to be stressed that this compar- 
ison should not be overinterpreted. This is, on the one hand, because the simulations 
vary in several aspects, e.g., they employ different temperature definitions. Moreover, 
all simulations have difficulties reproducing the temperature structure in cluster cores 
as observed by Chandra and XMM-Newton. Results in cores depend strongly on the 
additional physics put in. The point is now that the profiles in Fig.|9]are normalized by 
some average temperature ({k B T)), which depends also on th e temperature structure 
in t he core. We have, th erefore, increased {k B T) by 50% for the lRoncarelli et al.l ( 2006h 
and lVazza et al.l (|201ll ) profiles in order to roughly renormalize them to fit the cluster 
outskirts, as many authors find that employing different physics r ecipes does not a ffect 
much the ou ter temperature profiles. Without this rescaling, the lRoncarelli et al.l and 
IVazza et alj profiles would lie above the two other simulated profiles. 

In any case, the slopes of observed and simulated temperature profiles appear 
consistent in the outskirts; if anything, then the observed average temperature pro- 
file drops off slightly faster than the simulated ones at the largest radii, as already 
noted in the very fir st Suzaku temperature profile determination of a relaxed cluster 
( Reiprich et"aT]|2009l ). In Fig. [9] (right), this is illustrated by the dashed line, which 
gives the best linear fit to the data points in the range 0.3r2oo < r < 1.15r2oo as 
k B T/{k B T) = 1.19 - 0.84r/r 200 . 

It has already been mentioned that some authors provided more than one Suzaku 
temperature profile for a given cluster by subdividing the p rofile into azimutha l direc- 
tions, sometimes finding significant differences. For instance, iKawaharada et all (|2010h 
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Fig. 9 All Suzaku temperature profiles, excluding A3376 and A2804 (see text; left: individual 
profiles, right: binned average profile). 162 temperature measurements are binned into 18 radial 
bins, each containing 9 temperature measurements. Error bars denote the standard deviation 
of temperature measurements in each bin. Also shown are (partially renormalized, see text ) 
average tem p erature profiles from n umer ic ally simulated clust ers from lRo ncarclli ct al. (2006); 
iNagai et aD (|2007af) : iBurns et al] J2010f) : IVazza et all d201lTl . The dashed line represents a 
simple linear fit to the data points beyond 0.3r2oo- 



found that in the northeastern direction of Abell 1689 the temperature around the virial 
radius is about three times larger than temperatures in other directions. Moreover, they 
found this enhanced temperature to be correlated with a large-scale structure filament 
in the galaxy distribution and argue that thermalization is faster in this overdense 
infall region. 

So, with Suzaku we have been moving forward, temperature measurements out to 
~r2oo can De performed and we expect more progress in the next few years through 
homoge neous sample studies w ith Suzaku and the upcoming eROSITA and Astro- 
H (e.g.. iTakahashi et al.ll2012l) instruments, the latter satellite also c arrying a high 
spectral resolution micro-calorimeter array (e.g., iMitsuda et al]|2012l ). Nevertheless, 
we are still quite far away from the outer border of cluster outskirts according to our 
definition (3r2oo, Section [2j|- New X-ray missions with low particle background, low 
soft proton contamination, good PSF for AGN removal, large field-of-view, and large 
effective area will likely be required to reach this frontier. 

There is another route to constraining temperature profiles in cluster outskirts: the 
combination of X -ray surface brightness measurements with SZ decrement profiles (e.g., 
iBasu et al.ll201ol ). This will be discussed in more detail in Section r6.2l Naively speaking, 
the SZ effect depends on the gas density and the X-ray emission on the square of the 
density, therefore, the SZ signal might trace the low density cluster regions better 
than the X-ray signal. However, the actual situation is slightly less straightforward 
since the SZ signal also decreases linearly with decreasing temperature; i.e., in cluster 
outskirts (Fig. [5J) while the soft X-ray emission increases with decreasing temperature 
for k^Tc < 2 keV due to line emission (Fig. [2]). So, overall, the relative gain in sensitivity 
of SZ measurements compared to X-ray measurements in cluster outskirts is not as 
strong as naively implied by the comparison of density dependencies. 

4-1.3 Pressure profiles 

As the main balance for gravitation in massive halos, the distribution of thermal pres- 
sure within the ICM is of particular interest. It has been investigated via X-ray ob- 
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servations. Work based on XMM-Newt on observations of the REXCESS samp le, a 
representative sample of nearby clusters i Bohringer et al.ll2007l : IPratt et al.ll2009h . has 
demonstrated that the scaled distribution of the ICM pressure follows a universal shape. 
The observational constraints extend out to a radius of r^oQ. Within this radial range, 
the observed pr essure profiles are well-matched by predictions from various numer i- 
cal simulations (jBorgani et~ai1l2004l : lNagai et al.ll2007al: IPiffaretti fc ValdarninilEooi 'l. 
Without observational constraint beyond r§oo, lArnaud et alj (|20ld ) provided a simple 
"universal pressure profile" parametrisation of a generalized NFW (GNFW) function 
from the XMM-Newton data and the aforementioned numerical si mulation ( b eyond 
r 50o)- This work has been extend ed down to the group r egime bv ISun et al . (2011) 



gr, 

from Chandra data; see also, e.g.. iFinoguenov et alj (|2007l l for earlier work based on 
XMM-Newton. 

Baryons in the outskirts bear the signature of the continuous three-dimensional 
non-spherical accretion from surrounding filaments. In this sense, access to the level of 
thermal pressure in the cluster outer parts provides a neat way to assess the virialization 
degree achieved, the t hermodynamical state of the (pre-shocked) in-falling material 
( Voit et alJl2002 . l2003h . etc. 

Observational constraints on cluster outskirts are sparse and difficult to gather, 
although these regions encompass most of the cluster volume. X-ray observations have 
recently provided a first insight out to ~r2oo of the physical properties of the gas 
(Sections 14X11 and l4~l~2j) . 

The SZ effect has the potential to contribute greatly to the discussion on cluster 
outskirts due to its linear dependence on density and temperature. The radial pressure 
distributions of the first SZ cluster samples have recently been presented based on 
observations with, e.g., the S Z Array / Combined Array for Research in Millim eter-wave 
Ast ronomy (SZA/CA RMA, iMroczkowski et aDl2009l : iBonamente et al.ll2012i) APEX- 
SZ ijBasu et alJl2O10h . and the South Pole Telescope (SPT. IPlagge et al.ll201oh . These 
studies confirmed that the ICM properties as seen by SZ and X-ray observations are 
consistent at least out to r^QQ. Noticeably, IPlagge et al] fcoich have obtained from 11 
SPT clusters a stacked SZ profile out to ~(1.5-2)r;,nn i where the shape of t he underlying 
pressure profile is compatible with the one given bv Urnaud et al l d2010h. 

A significant set of results has recently been published by the Planck Collaboration et al] 
( 20131 ) based on the Planck nominal survey (i. e., 14 months of survey). Making u se of 
its full sky coverage over nine frequency bands ( Planck Collaboration et al]|2011al ) , the 
Planck satellite maps all cluster scales from its native resolution (5 to 10 arcmin at the 
SZ relevant frequencies) to their outermost radii, even for nearby clusters. The Planck 
Collaboration adopted a statistical approach to derive a stacked SZ profile from a 
sample of 62 nearby clusters detected in SZ with high significance. These clusters were 
selected from the Planck Early SZ (ESZ) sample and were p reviously used to investigate 
the to tal integrated SZ flux and the SZ scaling relations |Planck Collaboration et al] 
l201ld ). 

The statistical detection of the SZ signal extending out to 3rsoo provides the first 
stringent observational constraints on the ICM pressure out to a density contrast of 
S ~ 50-100. Correcting for the instrument PSF and deprojecting the 2D profile into 
a 3D one, the Planck collaboration derived the underlying thermal pressure profile 
of the ICM. This observed pressure profile is in excellent agreement with the one 
derived from XMM-Newton archive data for all 62 clusters, over the overlapping ra- 
dial range of (0.1-l)r5oo- The combined SZ and X-ray pressure profile gives for the 
first time a comprehensive measurement of the distribution of thermal pressure sup- 
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Fig. 10 Both figures and caption reproduce content from Planck Collaboration ct al. (2013), 
reprinted with permission. Refer to this paper for a description of the f(M) factors. Left: Planck 
pressure profile obtained from the average of the individual pressure profiles across the sample 
(red points) shown together with the stacked pressure profile derived from the XMM-Newton 
data for the same sample (purple points). The dispersions about the SZ and X-ray profiles are 
depicted, respectively, by the r ed and purple shaded areas. The best-fit GNFW profile is shown 
as a solid black line and that of Arnaud et al. (2Q1CJ) as a solid green line. The lower panel shows 
the x profile of these two best-fit models taking into account the statistical errors and dispersion 
about the observed profile. Right: Measured pressure profiles from Planck and XMM-Newton. 
The three shaded areas mark the dispersions about the average pr o files for three sampl es of 
simulated clusters: in blue, th e simulations from Borgani ct al. (2004), Nagai ct al. (2007aj) and 
Piff aretti fe Valdarninll IMPOST) , which were used in lArnaud" et al .1 Il2010r7 to derive the universal 
pressure profile together with the R EXCESS data; in green, the simu lated sample of c luster s 
from Dolag et al. (in prep.); and in brown the simulated clusters from lBattaglia eF al. (2012). 
The corresponding average profiles for each set of simulations are plotted as solid lines using 
the same color scheme. In the bottom panel, the \ profiles between the observed profile and 
the simulated average profiles are presented, taking into account their associated dispersion. 



port in clusters from O.Olrsoo out to 3rsoo. Similarly to lArnaud et al ] (|2O10h . the 
Planck Collaboration has derived an anal ytical representation assuming a GNFW pro- 
file (as formulated bv lNagai et alj|2007al ) with best-fit parameters [Pq, C500, 7, a, f3] — 
[6.41,1.81,0.31,1.33,4.13] (Fig. the outer slope (/3) being slightly shallow er than 
the extrapolation based on simulations that was used by I Arnaud et al ] (|2010h . Weak 
hints for even flat ter outer slopes hav e been found recently with Bolocam data of a sam- 
ple of 45 clusters ( Savers et al.ll20l3 ) . In the in ner parts, there seem s to be some tension 



between the XMM-Newton data points and the lArnaud et al.M2010r) XMM- Newton pro- 
file (although well within the dispersion). As discussed in Planck Collaboration et al. 
(2013), his is likely caused by differences in selection of the two samples (i.e., fraction 



of dynamically perturbed versus relaxed clusters). 

The observed joint Planck and XMM-Newton pressure profile is also in agreement 
within errors a nd dispersion over the whole radial range with various sets of sim- 
ulated clusters jBoreani et al.ll20o3 : iNaeai et ai1l2007al : IPiffaretti fc Valdarnin] 120081 : 
iBattaglia et al.ll2010l : Dolag et al. in prep.). Within the spread of predictions it matches 
best the numerical simulations that implement AGN feedback, and it presents a slightly 
flatter profile compared to most of the above theoretical expectations in the outerparts. 
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As outlined in Section [T] the physics at play in cluster outskirts is still to be 
understood. The constraints on gas pressure brought from the Planck stacked SZ mea- 
surements have shed light over a volume almost an order of magnitude larger than 
that accessible from X-ray data of individual clusters alone. Further SZ and X-ray 
measurements, especially for complete samples that are not subject to possible archive 
biases, will continue to provide an observational insight of cluster outskirts and serious 
constraints for theoretical studies on issues such as gas clumping, departures from hy- 
drostatic equilibrium, contribution from non-thermal pressure, etc., as will be discussed 
in the following several Sections. SZ and X-ray measurements complement each other 
nicely in this sense since they primarily constrain different physical properties of the 
gas (SZ: pressure, X-ray: density and temperature). 

4-1-4 Entropy profiles 




Fig. 11 Left: Scaled entropy profiles measured with Suzaku, apart from A1835 and A2204 
(Chandra, bl ack boxes and triangl es, respectively) and Virgo (XMM- Newton, green crosses); 
compiled by IWalker et alj d2012bl . reproduced with permission). The black lines show the 
entropy profile and scatter as derived by combining the Planck average pressure profile (Tig. llOt 
with the ROSAT average density profile (Fig. [BJ . The straight line represents a n r 1 1 power- 
law n ormalized to 1 at 0.3r200- Right: Scaled entropy profiles determined by lEckert et alj 
(2013b, reproduced with permission) . The strai ght dotted line again has a slope of r 1 1 but 
is normalized using the definition of Voit (2005). For the green band and the red triangles, 
the same data (average Planck and ROSAT profiles) as on the left are used. The two symbols 
represent different analysis methods (density determination through fitting a function to the 
emission measure profile and through direct deprojection, respectively). Also shown (circles) 
are the results from determining individual entropy profiles for 18 clusters in common between 
the Planck and ROSAT samples. 



Often, "entropy" is defined in this field as K = k^T gas /n e ; i.e., the discussion 
can be kept short here since it can basically be derived by combining Section 14.1.11 
(density profiles) either with Section H.l. 21 (temperature profiles) or with Section T4.1.3I 

5 /3 

(pressure profiles, through K — P c /n c )• Moreover, several aspects of entropy have 
already been addressed in Section [4. 1.1 1 

Typically, the temperature profiles observed with Suzaku in cluster outskirts tend to 
be fairly steep while the density profiles are often found to be less steep than expected 
(in particular for Suzaku). This results in a flatter slope compared to the expected 
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entropy profile gradient K oc r 11 (e.g..lVoitll2005h but no te also exceptions to this, e.g., 
the fossil group RXJ1159+5531 (|Humphrev et al-lboij - ). This is illustrated in Fig. [TT] 
where the Suzaku data on the left show a clear drop off at large radii while this is much 
less obvious from the results on the right, possibly due to systematic differences between 
ROSAT and Suzaku density profiles, different chosen normalizations for the r power- 
law, or differences in sam ple selection. Addit ionally, a possi ble difference in the way how 
the v ery same ROSAT (|Eckert et alj |2012( ) and Planck |Planck Collaboration et al.1 
120131 ) data are combined in both works is suggested by the presence (absence) of 
non- monotonicity in the a verage profiles on the left (right). Interpretations also vary; 
while lEckert et al.1 (|2013bl ) argue tha t the well-known cent ral entropy excess extends 
out further than previously thought, IWalker et al.l l|2012bh suggest that clumping in 
cluster outskirts could be one (but not the only) important effect. Since this review 
is concerned with cluster outskirts, we discuss clumping and several other possible 
physical and technical explanations in upcoming Sections, mostly in terms of effects 
on temperature and density measurements because the primary focus of this review is 
on mass profiles in cluster outskirts. 



4-1.5 Gas/baryon mass fraction 

The gas fraction, f s = M sas /Mtot, in clusters typically increases as a function of both 
mass and radius (e.g., IVikhlinin et al.l 120061 : see Section 7.4 in IReiprichl l200ll for a 
discussion of pre- XMM-Newton and - Chandra results). Going out far enough into the 
outskirts, the cosmic mean baryon fraction, e.g., as determined from measurements of 
the cosmic microwave background radiation or the primordial deuterium-to-hydrogen- 
ratio, should eventually be recovered. It would be interesting to measure the charac- 
teristic radius at which this usually happens because this would help constrain the 
physical processes relevant for depleting the inner cluster regions of baryons. More- 
over, cosmological applications involving the gas or baryon fraction of clusters rely on 
precise estimates of the gas/baryon depletion factor at a given radius and redshift (e.g., 
lEttori et al.ll2009l ; I Allen et al.1 [20 III ). 

As discussed in Section [47T7T] ROSAT and Chandra data as well as hydrodynamic 
simulations mostly indicate a steepening of the gas density profile with increasing radius 
up to ~r5oo and beyond, while even further out, Suzaku data seem to favor a flattening. 
Taken at face value, gas mass fractions in excess of the cosmic mean baryon fraction 
are sometimes implied. On the other hand, many different physical effects, considered 
in the following Sections, could result in an artificial trend by affecting either the gas 
mass or total mass determination or both. Also, some challenges for measurements are 
outlined in the technical Section [6. II 

The R OSAT PSPC was great for measuring gas density profiles out to very large 
radius (e.g.. lEckert et al.ll20I2l . l20I3al ) because o f the very low particle background level 
and large field-of-view. For instance, IReiprichl l|200lh measured the gas mass fraction 



within r2oo for 106 clusters. For 58 out of these clusters, only a small or no extrapo- 
lation of the measured surface brightness profile was necessary. The resulting / gas ,200 
histogram is shown in Fig. If 21 One notes that the Perseus cluster, a prominent ex- 
ample for a high Suzaku gas mass fraction and discussed in terms of gas clumping 
in Section 14.1 .1 1 has one of the highest observed gas mass fractions of all 58 clusters 
(/gas, 200 — 0.21, in excellent agreement with the Suzaku measurement). While the sta- 
tistical and systematic uncertainties of the total mass measurements are large, resulting 
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in a broadening and possibly a shift of the distribution in Fig. 1121 this could be an in- 
dication that the Perseus cluster just happens to lie on the extreme tail of the intrinsic 
cluster /gas, 200 distribution. Note that while t he intrinsic dispe rsion of the / ga s,2500 
distribution is small (for relaxed clusters, e.g.. lAllen et al.ll201lf ) it may be larger for 
/gas, 200 (also considering the whole cluster population). Therefore, more Suzaku (and 
Astro-H) and SZ observations of a complete sample of clusters may be required for a 
full understanding of the typical baryon fraction and possible gas clumping in cluster 
outskirts. 




0.05 0.10 0.15 0.20 

f gas, 200 

Fig. 12 Gas m ass fractions within r2oo based on ROSAT PSPC data. Values taken from 
Rciprich (2001) and converted to Ho = 70 km/s/Mpc. Only those 58 clusters are shown 
for which significant cluster emission has been detected out to r > 0.8r2oo- The gas mass 
fraction of the Perseus cluster - in excellent agreement with the recent Suzaku measurement 
(Simioncscu ct al. 2011) - is marked with an arrow. WM AP9 data indicate a cosmic mean 
baryon fraction of around (16-17)% llHinshaw et al.ll2012T) . 



4.2 Structure formation in action 



Structure formation simulations show that gal axy clusters grow throu g h mergers and 
infall of matter clumps along filaments (e.g., iBorgani fc Guzz ol |200l|; [Springcl et_al 



2005). Observational evidence of the former is widespread fe-g-. TMarkeTrtch^Tvikliliriin 



2007). Filamentary structures have also been seen for decades, e.g., in the galaxy and 



galaxy cluster distribution (e.g., Fig. 1 13 pi - The expected filamentary gas distribution 
between clust ers; i.e., the WHIM, l ikely containing a signific ant fraction of all local 
baryons (e.g.. lCen fc Ostrikerlll999l ; iFukugita fc Peeblesll2004|). still evades a very sig- 
nificant robust detection in X-rays (e.g., Kaastra et a l. 2003; B regman fc Llovd-Daviea 
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Fig. 13 Shown as yello w circles are ~100 X-ray bright nearby galaxy clusters 
(Rciprich & Bohringcr 2002). The circle radii indicate the apparent T200- In grey, the distribu- 
tion of ~0.5 million near-infrared selected local galaxies from 2MASS is shown djarrett et alj 
2000). Both populations trace the same filamentary structure and superclusters. The reason 
one can see the same structures with a few X-ray galaxy clusters as with a much larger num- 
ber of galaxies is that clusters are strongly biased tracers of the matter distrib ution; they are 
the lighthouses of structure in our Universe. Reprinted with permission from Rciprich ct al. 
J2003f) . 



20061 : iNicastro et alJ 20051: 1 Williams etliDl2006l : iRasmussen et al.ll2007l : iKaastra et aD 
2009 : Bregmanl 120071 : iBuote et all 120091 ). Significant progress has bee n made at UV 



wavele ngths (tracing the lower temperature phase of the WH IM) ; see K aastra et "all 



diooi ) for detailed review articles on the WHIM, in particular iRichter et al.l J2OO8) 



We are currently entering an era when the region between the well-observed cluster 
centers (< r^oo) an d the elusive WHIM (> 3r2oo) comes within reach of X-ray and SZ 
measurements (Section I4.1| l. This is a region where a lot of action related to structure 
formation is expected to be happening. For instance, Fig. [1] shows infalling clumps of 
matter. Typically, these higher density regions are predicted to have a cooler temper- 
ature than their surroundings. Observational confirmation is now needed to test this 
picture in detail. On the other hand, if these clumps are generally present but remain 
undetected, e.g., due to poor spatial resolution, they will bias the X-ray gas density 
and temperature measurements in the outskirts (this is described in Sections 14.1.11 
and 14. 6p and, unless they are in pressure equilibrium with the ambient gas, they will 
also bias interpretations of SZ measurements that assume a smooth distribution. So, 
quantifying di fferences of X-ray and SZ results may allow us to const rain unresolved 
clumping fe.g.. lGrego et al.ll200ll : Ijia et alJliooi : iKhedekar et alJlioij ). 

More action in outskirts related to structure formation includes, e.g., large Mach 
number accretion shocks and corresponding particle acceleration. While these phe- 
nomena may be traced throu gh the non-thermal pa rticle population in the radio, hard 
X-ray, and 7-ray bands (e.g., Pfrommer et alJ lipOS). also soft X-ray and SZ measure- 



ments are required for a full understanding of the overall plasma properties. 
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4.3 Hydrostatic equilibrium 

In the following Sections, we will discuss a few (non-) equilibrium situations. For the 
X-ray and SZ mass determination, it is perhaps most straightforward to see that hy- 
drostatic equilibrium is important since it is the basis for eq. ([6j). 

For major cluster mergers, it is obvious that the assumption of hydrostatic equi- 
librium is not a good one. Moreover, simulations suggest that also in less disturbed 
clusters, turbulence and bulk motions, e.g., due to infalling clumps, may disrupt hy- 
drostatic equilibrium at some level fe.g.. lNagai et al.ll2007bh . This appears to be more 
significant the f urther out one goes in terms of fraction of overd ensity radius (e.g., 
lLau et al.|[2009l ; lMeneghetti et al.ll201fj ). See also lSuto et all (|2013T l for an alternative 
interpretation of hydrostatic equilibrium biases as acceleration term in the Euler equa- 
tions. 

Observationally, direct measurements of/upper limits on ICM turbulent velocites 
and bulk motions a re currently limited to cluster cores or to merging subclusters (e.g., 
Sanders et al. 2011. using the XMM-Newton Reflection G rating Spectrometer (RGS); 



Sato et al. I2008I -1 bugawara et al.ll2009l ; lTamura et al.ll201ll . using Suzaku CCDs). Claims 



of an observational detection of su ch motions in the Centaurus clu s ter ha ve been made 
using ASCA an d Chandra data (|Dupke fc Bregmanl l200ll . 120051 . I2006T ) and refuted 



( Ota et aTll2007r ) using Suzaku data. A robust direct measurement of turbulence may 



need to await future X-ray missions, like th e upcoming Astro-H ca rrying high spec- 
tral resolution micro-calorimeter arrays (e.g.. lZhuravleva et al.ll2012l '). To reach cluster 
outskirts, even larger effective are as will be required, possibly provided through an 
envisaged Athena-like mission (e.g.. iBarcons et al]|201 j ; lNevalainenll2013l ). 

In cluster centers, gas "sloshing" is widespread, resulting in spira l-like patterns in 
the surface brightness distribution fe.g.. lMarkevitch fe Vik hlinin 2007, for a review). It 
has been suggested based on ROSAT, XMM-Newton, and Suzaku observations of the 
Perseus cluster that features produced by such motio ns could extend even out to radii 
approaching the virial radius ( Simionescu et al"1l2012r i. 



Another highly sensitive probe of the physical state of the ICM in cluster outskirts, 
in particular turbulent pressure support in high-z (z > 0.5) objects, can be through the 
Sunyaev-Zel'dovich effect angular pow er spectrum, which is the integrated signal from 



all th e unresolved clusters in the sky ( Komatsu fc Kitavama|[l999l ; iKomatsu fc Seliakl 



2002). The SZ power spectrum is measured as a foreground component of the CMB 
signal with the same spectral dependence as the SZ effect. Half of the SZ power comes 
from low-mass clusters and groups ( M200 < 2 x 10 14 M(7)), an d half of it also comes 



^K_ana 

from high-redshift (z > 0.5) systems (|Komatsu fc Seliakl 120021 ) . Therefore, SZ power 



might provide the only method to study these otherwise un-observable systems with 
low mass and high redshift, although only through their summed contributions. 

Unlike X-ray brightness, the integrated SZ signal of a cluster carries a significant 
weight from the volume outside of rsoo, so the prediction of SZ power is strongly 
dependent on the pressure profile in the outskirts. The difficulty lies in detecting 
the SZ power itself, which is not the dominant source of foreground anisotropy at 
any frequency or angular scale. Recent SPT measurements have constrained the tSZ 



power at a low amplitude: 3.5 ± 1.0 /jK 2 at I = 3000 l|Shirokoff et al.ll201iri. This is 



lower than the prediction from X-ray derived cluster pressure models (IShaw et al 1 l2010l : 
lEfstathiou fc Migliaccioll2012h . but the difference can be the result of many different 
effects, like pressure support from gas bulk motions, AGN and star-formation feedback, 
or other non self-similar evolution of the ICM. Future CMB measurements with better 
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frequency coverage and angular resolutions are expected to place a tighter constraint 
on the thermal SZ power and break some of these degeneracies. The cosmology de- 
pendence of the SZ power can also be nailed down by other methods, for example an 
accurate value of os will reduce its degeneracy with the uncertain gas physics. 

Another possible, indirect route to determining how strongly X-ray and SZ hydro- 
static mass determinations are affected by turbulence, bulk motions, and other gas 
physics effects is through comparison to weak lensing measurements, which do not rely 
on the assumption of hydrostatic equilibrium, in principle. This has been done exten- 
sively for the inner parts of clusters (<r5oo, e.g., Ettori et al. and Hoekstra et al., this 
volume, for reviews) and it may be feasible now also for cluster outskirts, although also 
the weak lensing mass reconstruction accuracy is more limited in the outer parts (e.g., 
Section lOTT) . 

In addition to merger- or accretion-induced bulk motions, also convection may occur 
in the ICM. While a gas temperature gradient in itself does not imply a violation 
of hydrostatic equilibrium, convection should occur if the specifi c entropy decreases 
significantly with in creasing radius (for a non-magnetic ICM, e.g.. lLandau fc Lifschitj 
Il99ll : ISarazinlll986l . Sections 4 and V.D.6, respectively); i.e., if 

£<«• < 9 » 

With 

socln^ (10) 

Pgas 



and 7 = 5/3 this condition becomes 

d In Tgas 2 d In p gas 



(11) 



dr 3 dr 

Typically, density gradients in clusters are much steeper than temperature gradients; 
so, generally no convection is expected. However, in cluster outskirts, there are some 
indications that density profiles may flatten (Section 14. 1 . 1 1> . temperature profiles may 
steepen fSection l4.1.2[) , and entropy profiles may turn over (S ectionl4. 1.411. For instance , 
convectional instability was considered early on for A2163 bv lMarkevitch et al ] (|l996h . 
In the presence of magnetic fields, however, the situation may be more comple x due to 
other possible instabilities, in particular the magnetother mal instability ( MTI, iBalbud 
I2000T ) and the heat-fiux-driven buoyancy instability (HBI. lOu atacrt 200a). 



4.4 Thermal equilibrium, T e = T gas ? 

With several tens of millions of degrees the ICM is hot. An important heating mecha- 
nism is shock heating, either in cluster mergers or when infalling gas passes through an 
accretion shock. The ICM consists of electrons and ions. Most of the ions are protons, 
so for simplicity we just use the term protons now. In an accretion shock, primarily 
protons should be heated since they carry most of the kinetic energy. After leaving the 
shock front, the time scales for protons and electrons to reach Maxwellian velocity dis- 
tributions arc short; i.e., they will both settl e into therma l equilibrium quickly, albeit 
at different temperatures. The time scale is (|Spitzerll 19561 . eq. 5-26) 



tcq (x, 



1.13X10 yrZ x ^— j (^^^ j U ^3 cm - 3 J 



(12) 
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where Z x is the particle species charge, m x its mass, T x its temperature after reaching 
equilibrium, n x its number density, and lnyl x its Coulomb logarithm. So, in typical 
cluster regions, this is about t e q(p, p) ~ 10 Myr for protons and for electrons about a 
factor of 43 faster, 



teq(e,e) « 



II I c 



1/2 



ioq(p,p) « 2.64xl0 5 yr 



fcg?e 
lOkeV 



n c 



In A 

^10" 



(13) 

If the electrons are indeed heated less efficiently in the shock than the protons, resulting 
in T < T p after equilibration, they will reach equilibrium even faster than implied by 
the square root of their mass ratios. 

What takes longer then is raising the electron temperature to the proton tem- 
perature. Assuming this process proceeds primarily through Coulomb collisions, the 
corresponding equipartition time scale is very roughly a factor of 43 longer than the 
proton equilibration timescale, 



We,p) : 



-1/2 



*eq(p,p) , 



(14) 



and is given by ( Spitzer|[l956l . eq. 5-31) 



t cq (e,p) -2.51 x 10 s yr 



lOkeV 



10- 



ln A: 
40 



1 + 



Tp m G 
T c m p 



(15) 

The last two factors are close to 1, so they are ignored in the following and we also 
assume n c « 1.2n p . In cluster outskirts, both ICM densities and temperatures are 
significantly lower than in the better observed inner regions. Since the densities drop 
much more quickly, the net effect is an increase of the equipartition timescale. It can 
come close to one Gyr or even more, 



We,p) 



10 yr 



fcgje 
IkeV 



He 



10- 



(16) 



So, there the X-ray-emitting electrons may have a cooler temperature than the invisible 
protons, resulting in a steeper X-ray temperature gradient towards the outskirts and, 
therefore, an underestimate of the total mass (Fig. [3j|. 



skirts/WHIM has been studied theoretically by many authors (e.g., Fox & Loebl 
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2010|). It 



is worth mentioning that the situation may differ between merger shocks (small Mach 
numbers) and accretion shocks (large Mach numbers): the electron heating efficiency 
(relative to t he one for protons) in a shock may be anticorrelated to the Mach num- 
ber squared I Ghavamian et all 120071 ) if other processes in addition to pure Coloumb 
heating are considered, resulting in fast relative electron heating in merger shocks and 
slow heating in accretion shocks. Indeed, for the textbook merger shock in the bullet 
cluster, the equipartition time scale seems about a factor of five shorter than implied 
by eq. 1)1511 (Markevitch & Vikhlinin 2007) assuming the Chandra lower temperature 
limits from iMarkevitc h (2006) for the extremely hot post-shock gas to be robust against 
systematic calibration uncertainties. 
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Observational results have also been obtained for cluster outskirt meas urements. 
For instance, the early ASCA work on Abell 2163 bv lMarkevitch et al ] (|l996h triggered 
some of the theoretical studies mentioned above. More recently, Suzaku data have been 
used to constrain the equipartition state (e.g., iHoshino et al.l lipid ; lAkamatsu et al.l 
1201 lh . With current instruments, no strong direct constraints have been achieved, 
though. 

Fig. 2 of iRudd fc Nagail {2009) implies that for unrelaxed and relaxed clusters of 
similar temperature, the electron temperature in the outskirts deviates more strongly 
from the ion temperature for unrelaxed clusters. Naively, one might conclude from the 
similarity of Suzaku profiles (Fig. [8]) that no such trend is observed; i.e., no strong 
evidence for significant non-equipartition. However, a proper comparison would need 
to account for the possible difference of intrinsic temperature profiles between relaxed 
und unrelaxed clusters, for the inhomogeneous definitions used to classify clusters as 
un-/relaxed, and, since the equipartition timescale depends on temperature (ea. I16|) . 
for the possible difference in the temperature distributions of the subsamples. 

Since there are not only protons but also ions in the ICM, some of which are 
strong X-ray line emitters, there is hope to measure the ion temperature directly 
from the line width. This might be feasible with upcoming missions. For instance, 
the micro-calorimeter array aboard Astro-H will have an energy resolution of <7 eV 
(e.g., [Takahashi et al. 2 0121). poss ibly just sufficient to detect a thermal broadening of 



-5 eV (e.g.. iRebusco et al.ll200a . in the absence of other more dominant line broad- 
ening effects like bulk motions and turbulence) but only in the X-ray bright central 
regions where the collisional equilibration time scale is shorter than the typical time 
since the last major merger in any case. An Athena-like mission could have enough ef- 
fective area and energy resolution to measure the thermal broadening in nearby cluster 
outskirts, where the ion and electron temperatures possibly deviate. 



4.5 Ionization equilibrium, Tx = T c ? 

A very rough timescale for an astrophysical plasma to reach collisional ionization equi- 
librium (CIE) is given by 

iio„-e q -10 12 s(-^3) _1 (17) 



with some dependence on the considered element and temperature ( Smith fc Hughesl 
2010). Most ions in the ICM are not in an excited state because radiative deexcitation 
is very fast. 

Recasting (|17l) into units typical of cluster outskirts yields 



^ion— cq 



Therefore, on the order of a Gyr may be required in cluster outskirts - long enough to 
possibly result in an important, measureable effect for some clusters. 

For a given temperature, how do plasma spectra differ depending on the ionization 
state? Fig. [14] shows that stronger low energy emission lines are present when the ICM 
has not yet achieved CIE (r < 10 12 s/cm 3 ). In typical observed spectra with CCD-type 
energy resolution, this results in a shift of the peak position of the ~1 keV emission 
line complex towards lower energies, especially for a low temperature plasma (Fig. ll5[) . 
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Fig. 14 Model spectra of 1 keV (left) and 5 keV (right) plasmas for different values of the 
ionization timcscalc. Black: r = 8 X 10 10 s/cm 3 , red: r = 3 X 10 11 s/cm 3 , green: r = 8 X fO 11 
s/cm 3 , and blue: r = 3 X 10 12 s/cm 3 . The normalizations are scaled for visualization purposes. 
Other parameter values are TVh = 3 x 10 20 cm -2 , metal abundance = 0.2 solar, redshift = 0.1. 

This indicates that one might obtain a biased temperature if one wrongly assumes 
CIE in the fitting process. Due to this shift, as illustrated in Fig. 1171 a bias towards 
lower temperatures may then be expected. Indeed, this is the case as Fig. \W\ (left) 
demonstrates. In the most extreme non-equilibrium ionization (NIE) cases simulated, 
temperatures are underestimated by a factor of ~2. Since this bias, if present, will 



data and folded model 
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Fig. 15 Simulated Suzaku observations of a single-temperature ICM (k^T = 1 keV) that is 
not fully in collisional ionization equilibrium (2,000—3,000 source photons). Black: r = 8 X 10 10 
s/cm 3 , red: r = 3 X 10 11 s/cm 3 , and green: r = 3 X 10 12 s/cm 3 . 

likely be larger for larger radius where the density is lower, it will translate into a too 
steep temperature profile, resulting in an underestimated total mass (Fig. [3}. 

In Section[5l the importance of cluster chemistry especially in outskirts is described. 
Here, we show in Fig. [16] (right) how the metal abundance determination is biased if 
CIE is assumed when it is not yet established. While there is no strong bias for low 
temperature clusters, the metallicity gets severely overestimated for hot clusters in the 
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Fig. 16 Loft: Best-fit temperatures when incorrectly assuming collisional ionization equilib- 
rium as a function of t; for simulated Suzaku data, assuming temperatures of 1 keV and 5 keV. 
Other parameter values as in Fig, 1141 Right: Metallicity bias under the same assumptions. 



most extreme case (t = 8x 10 10 s/cm 3 ). This is due to enhanced line emission in the 
NIE case. 

Non-equilibrium ionization effects in low density plasmas have been studied theoret- 
ically in detail by sever al works (e.g.. lYoshikawa fc Sasakill2006l ; I Akahori fc Yoshikawal 
l201oi : IWong et alJl201lh. Some authors have also tried to estimate these effects obser- 
tionallv (e.g.. iFuiita et alJ l2008bl : iFinoguenov et all l20ld : lAkamatsu et alJ l2012al l . 



Future high-spectral resolution instruments like the upcoming Astro-H or an envis- 
aged Athena-like mission may be able to set tight constraints on ionization states in 
the outskirts of galaxy clusters and thereby constrain merger timescales. 



4.6 How to disentangle multitemperature structure, Tx = T c 



The widespread pre sence of temperature gr adients in central (e.g ., Allen et al.l 



200ll ; 

iHudson et all 120101 ) and intermediate (e.g.. IVikhlinin et al.ll2005l : IPratt et all 120071 ) 



cluster regions, as well as in cluster outskirts (Section I4.1.2p shows that the ICM is 
not isothermal. Moreover, gas temperature map s show that even at a given radius, a 
wide distribution of temperatures can exist (e .g. jReiprich et aDl2004l : iMillion fc Allenl 
2009; Randall ct al. 120101 : lLovisari et alll201ll ) , possibly depending on dynamical state 
fe.g.. lZhang et al.l l2009). Additionally, in cluster outskirts, emission from new matter 
infalling along filaments and possible cooler clumps might become more important 

(Fig. sj . 

Therefore, in a given spectral extraction region (say, an annulus in cluster out- 
skirts), emission from gas at multiple temperatures may be present. The data quality 
(e.g., number of source photons, signal-to-noise ratio, energy resolution) on the other 
hand may not be sufficient to constrain a multitemperature model. A single temper- 
ature model will then have to be fitted to the multitemperature spectrum. What will 
the best-fit temperature be? It has been s hown that the answer depends on the used 
X-ray mirror /filter /detector system (e .g., iMathiesen fc Evrardl l200ll ; iMazzotta et al.l 
120041 : iRasia et aljEooil : IVikhlininlboOrj ). This is easy to understand: the electron tem- 



perature, T e , is usually estimated from fitting a model, convolved with the instrumental 
response, to an observed spectrum. The most temperature-sensitive features in an ob- 
served spectrum with CCD-like energy resolution are (recall Fig. [2J the exponential 
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bremsstrahlung cutoff at high energies, the slope of the bremsstrahlung emission at 
intermediate energies, and the location of the emission line complex at low energies 
(Fig. I17p . An instrument with more effective area at low energies compared to high 
energies - relative to another instrument - is more sensitive to the low energy features 
and, therefore, will typically give rise to a lower single temperature estimated 



k B T = (0.3, 0.5, 0.75, 1.0, 1.5, 2.0, 3.0) keV 




Energy [keV] 



Fig. 17 The low-energy emission line complex (predominantly Fe and O) as a function of 
the intracluster electron temperature assuming a metal abundance of 0.3 solar, a neutral 
hydrogen column density of Nu = 3 X 10 2t) cm -2 , and a redshift of z = 0.05. The energy 
resolution approximates that of current X-ray CCDs. All spectra assume the same electron 
density distribution. Notice how the emission peak shifts to higher energies for increasing 
temperatures, from orange (0.3 keV) to black (3 keV). 



An illustration for the simple case of a two-temperature plasma is shown in Fig. [TH] 
for Chandra, XMM-Newton, Suzaku, and eROSITA. Spectral data are simulated for 
these instruments assuming emission from plasma at two different temperatures (0.5 
keV and 8 keV), with varying contributions (emission measure ratios - the x-axis) and 
assuming three different metallicities (0.3, 0.5, and 1 times solar) for the cooler compo- 
nent (the hotter component is always assumed to have a metallicity of 0.3 solar). Then 
single temperature (and single metallicity) models are fit to the simulated data and the 
best-fit temperatures are shown in the plots. The median values of 100 realizations are 
shown and the 68% errors are taken from the distributions. The values of parameters 
not shown are: hydrogen column density = 3 x 10 20 cm -2 , redshift z = 0.1, number 
of source photons = 10 , and the energy range used for fitting is (0.5-8.0) keV. For 

8 Note that the best-fit single temperature depends on other factors as well, especially on: (i) 
the metal abundance because the higher the abundance the stronger the low-energy emission 
line complex, (ii) the hydrogen column density because a higher column density has the same 
effect as a decreased effective area at low energies, and (iii) the background characteristics 
because, e.g., an instrument with higher particle background has a poorer signal- to- noise ratio 
at high energies, which is similar to a decreased effective area at high energies. 
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these illustrations, the source emission is assumed to dominate over the background at 
all energies, so no background is included in the simulations. 

The plots show how the best-fit single temperature decreases with increasing emis- 
sion measure ratio of cold and hot component (from 10% to equal emission measure). It 
also becomes clear how the temperature decreases with increasing metallicity because 
the number of low-energy photons constraining the fit increases in this case. Moreover, 
the plots clearly reflect the different sensitivities of the different instruments, e.g., the 
relatively hard Suzaku XIS-FI typically returns much higher temperatures than the 
relatively soft eROSITA. 
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Fig. 18 Best-fit single temperature of two-temperature plasma emission as a function of 
emission measure ratio of the two components, for different X-ray instruments: Chandra ACIS-I 
(upper left), XMM-Newton EPIC-pn (upper right), Suzaku XIS-FI (lower left), and eROSITA 
(lower right). See text for details. 



So, assuming increased multitemperature structure with increasing radius in cluster 
outskirts, e.g., due to infalling cold matter, eROSITA would see a steeper temperature 
gradient than Suzaku and, therefore, would give rise to a lower total mass estimate 
(Fig. [3]) if the different temperature components cannot be spectrally disentangled. 

This seems dramatic; however, we have picked an extreme case (k^T e ij 0t = 8 keV, 
&BT c ,cold ~ ^ keV) for illustration purposes. As long as at least one temperature 
component is cooler than about 1 keV, the reduced x 2 ls actually quite bad (>1.5) in 
most cases (for 10 4 source photons in the absence of any background); i.e., it is actually 
clear from the observed spectrum that a single temperature model is a bad fit. Only 
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if both temperatures are above about 2 keV provides the single temperature model an 
acceptable fit[f] In the presence of significant background, however, acceptable fits can 
be obtained also for cooler components. 

Fig. [T5] illustrates that also the metallicity determination is biased if the presence 
of a multitemperature plasma is ignored. For instance, in case both components have 
a metallicity of 0.3 solar (the red data points), the resulting best-fit single component 
metallicty could be higher for the part ic ular situation simulated here. For a more 
detailed discussion see, e.g., iBuotei |2000l ); lGastaldello et al.l l|2010l ). 
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Fig. 19 Metallicity bias when assuming a single-temperature and single-metallicity emission 
while the true emission actually has two components. Simulated Suzaku data have been used. 



Future instruments (e.g., the upcoming Astro-H and an envisaged Athena-\ike mis- 
sion) will have enough collecting area and spectral resolution to better distinguish 
multitemperature structures spectrally. This will be crucial for a better understanding 
of the thermodynamics and chemistry especially in cluster outskirts. 



4.7 Helium sedimentation 



While potentially important for cluster (gas) mass estimates, this topic is likely more 
relevant for inner cluster r egions; therefore, it is only briefly summar i zed here. More 
details can be found, e.g., in lEttori fc Fabianl (|200d ); lMarkevitchl (|2007r ); |Peng fc Nagail 
(|200gh ; iBulbul et al.l i^OHl ), and references therein. 



The mean particle weight, fi, enters the hydrostatic equation l[6)l: it is inversely 
proportional to the total mass, MtotEB Since the overdensity radius also depends on 
MtoX (eq. 13}, the effect on the mass of changing /i is amplified, resulting in a (~1.5 



9 Naively, one might draw an immediate conclusion from these very simple simulations: as- 
suming the reduced x 2 values of the published Suzaku temperature profiles in cluster outskirts 
are close to 1, then the observed excess surface brightness either would not be due to clumping 
or the clumps would not be cool (<2 keV, see also Sections 14. 1.11 and 14. 81 . However, this con- 
clusion would be flawed because repeating the above simulations with significant background, 
as is the case in cluster outskirts, the reduced x 2 values actually stay acceptable in most cases 
(even for k^T c co \^ < 2 keV). This illustrates that interpretations of observational findings in 
cluster outskirts always need to account for the background. 

10 Recall that for the functional form of eq. 10 the radial gradient of fi has been assumed to 
be small. 
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times) stronger dependence for masses determined within an overdensity radius, e.g. 
-^2500- On the other hand, to calculate the ICM mass, M gas , from the observed electron 
density distribution (eq. 0), the ratio £ s ™ needs to be known. So, the composition of 
the ICM matters for both the Mtot and the M gas determination. 

While, e.g., an iron atom is more than ten times heavier than a helium atom, 
helium is about 10,000 times more abundant in typical I CM conditions than iron . 
Moreover, diffusion ope rates faster for lighter elements (e.g.. lGilfanov fc Sunvae vlll984l ; 
IChuzhov fc Lo cb 2004) . Therefore, the strongest influence on u and f e °* should come 
from variations in the helium-to-hydrogen ratio, 

If a significant fraction of helium ions settled towards the cluster center over time, 
then 22^ would increase there. Hydrodynamic simulations indicate, an increase of 
the helium-to-hydro gen ratio of a factor of two might be realistic in some cases (e.g., 
IPeng fc Nasal 2009. with some dependence on the strength and structure of the cluster 
magnetic field and the temperature). Assuming — 0.083 and typical abundances 
of heavier elements, /x is approximately in the range 0.59-0.60, and in the range 

1.14-1.15. Doubling the helium-to-hydrogen ratio, one obtains about 0.67 and 1.25 for 
the same quantities, respectively. Neglecting helium sedimentation; i.e., assuming the 
wrong helium-to- hydrogen ratio (0.083), would in this case bias high Mtot by ~13% 
and Mgas low by ~8%, within a fixed radius. If these were the only effects, then the 
gas mass fraction, f s = M ga s/Mtot, would be biased low by ~19% compared to the 
true fraction. 

However, there is another effect because, actually, only the X-ray flux is directly 
observed, not the electron number density. Since the emissivity (eq. [7} depends on 
the relative number of ions present, the flux to n c conversion is affected by the as- 
sumed helium-to-hydrogen ratio. If the true helium-to-hydrogen ratio is higher by a 
factor of two than the assumed one then the electron density will be overestimated 
by about 5% when fitting a typical cluster spectrum (taking into account that the 
true electron-to-hydrogen ratio increases from about 1.17 to 1.33 when increasing the 
helium abundance) 

Therefore, considering all three effects (underestimated [i, underestimated , 
and overestimated n c ), /g might be very roughly underestimated by ~14% in some clus- 
ter centers, given these very simplified assumptions. Since the true helium-to-hydrogen 
ratio in cluster centers is unknown, f s might be best determined within large radii, ap- 
proaching cluster outskirts, where the helium-to-hydrogen ratio should not be strongly 
affected by sedimentation. Combining X-ray, SZ, and gravitational lensing observations 
could, in principle, enable a measurement of the helium sedimentation factor. In par- 
ticular, the SZ effect depends only on the electron density and not on the product of 
electron and ion density as the X-ray emission. 

11 Note that, especially for low temperature systems, the abundance of heavy elements would 
be underestimated during a spectral fit due to the overestimated electron density, possibly 
giving rise to artificial observed abundance drops (or less steep increases) towards cluster 
centers due to helium sedimentation. Furthermore, independent of any helium sedimentation, 
when using the XSPEC apec or mckal model normalization parameter to determine the electron 
density then the result depends on the chosen built-in abundance table; e.g., for a typical cluster 
spectrum, the flux in the (0.1—2.4) keV band decreases by ~7 % for a given normal i zation if the 
Lodders (2003) abundance table is used instead of the default Anders & Grcvcssc (1989). This 
is because the Lodders table assumes a helium-to-hydrogen ratio of 0.0792 while for Anders & 
Greve sse it is 0.0977. WM AP3 data indicate a primordial value around 1/(4/0.24819 - 4) « 
0.083 dSpergel et al J2007T) while the final nine-year WMAP data sugg est a higher value aro und 
0.11, although consistent with WMAP3 at the 95% confidence level llHinshaw et aI.ll2012D . 
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4.8 Gas clumping 

Gas clumping; i.e., inhomogeneities in the gas distribution (and possibly the tempera- 
ture distribution), must be present in the intracluster medium at some level. However, 
at which level, at what size scales, and how it varies with increasing distance from 
the cluster center is not very clear, yet. Observational and theoretical aspects of gas 
clumping have already been discussed in several previous Sections, including partic- 
ularly Section [4.1.11 This will not be repeated here. In short, if present, the effect of 
increasing clumping with radius would be to overestimate the gas mass, because the 
gas density in a given region would be overestimated, and to underestimate the total 
mass, because the gas density profile would appear to drop less steeply (the total mass 
is proportional to the gas density gradient, eq. |6j), and because the temperature profile 
may appear to drop more steeply if the clumps are cool (Fig. [3] shows that a larger 
gradient in temperature typically results in a lower total mass estimate). That is, each 
of the three effects would tend to result in an overestimated gas mass fraction. 

If significant clumps are present at large-enoug h scales, they mig ht be directly ob- 
served in deep X-ray surface brightness images fe.g-. IVazza et all2013T ). Alternatively, if 
clumps are cool, they may be revealed through X-ray spectroscopy. Furthermore, com- 
parison of X-ray and SZ-derived ICM profiles may put upper limits on the clumping 
factor (e.g., references in Section r4.2[) . Also, one may get constraints from considering 
whether the already existing slight tension between predicted and observed SZ angular 
power spectrum discussed in Section 14.31 might become more severe if significant gas 
clumping were present in cluster outskirts and resulted in an even more increased SZ 
signal. 

5 Chemistry 

5.1 Relevance of chemical evolution 

Among the several outstanding problems of modern astrophysics, the chemical enrich- 
ment of the Universe is certainly a very topical one. In the standard scenario, shortly 
after the Big Bang, the element composition of the cosmic gas was very simple - it was 
composed almost entirely of hydrogen and helium, the two lightest chemical elements. 
Heavier elements than helium are thought to have their origin in the interiors and explo- 
sions of stars, which formed much later in the history of the Universe. It is believed that 
the fi rst stars ( "Population H I " ) formed few hu ndred million years after the Big Bang 
fe.g.. lCouchman fe Reeslll986l : lAbel et alJl2002h from collapse of primordial, metal-free 
gas and they have subsequently polluted the surrounding medium by ejecting the met- 
als produced in their cores fe.g.. balli fc Pallall 19981 ; iBromm et al.lll999l ;l Yoshida et al.l 
2008; ICen fc Riquelme|[20oi ). The minimum amount of gas that may collapse under its 
own gravity is ap proximately given by the Jeans mass, which increases with tempera- 
ture as y 3 / 2 fe.g lBromm fc Loebl200a ). In a primor dial environment, the o nly relevant 
coolants are H, He and their derived molecules (e.g.. ISaslaw fc Zipovlll967r ). which will 
not cool below a relatively high temperature of (100-200) K. Thus, the Population III 
stars are supp osed to be more massive and short-lived fe.g.. IWooslev fc Weaverlll995l : 
ISchaerer1l2002h than the typical stars that form from metal-rich gas. Once the Popula- 
tion III stars had enriched the gas, cooling due to metal line emission became efficient 
and gas clouds could cool to lower temperatures and fragment into smaller pieces. The 
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presence of metals probably drove the transition from a top-heavy initial mass function 
(IMF) to a "Salpeter-like" IMF allowing low- mass stars to form. 



5.2 Metalhcity in cluster outskirts 



The ICM does not only contain primordial elements, but also a considerable amount 
of heavy elements. As metals are only produced in stars, which reside mainly in galax- 
ies, the enriched material must have been transported from the galaxies into the 
ICM. Several processes have been proposed to explain the observed enriched mate- 
rial: ram-pressure stripping, galactic wi nds, galaxy-galaxy interacti ons, AGN outflows, 
and intracluster stellar populations (see lSchindler fc Diaferioll2008| for a review). Each 



of them has a different efficiency and pattern distribution (e.g., Schindler et al.l l2005: 



iDomainko et alj|2006t IKapferer et aljliooil ; iMoll et al.ll2007 ). Simulations show an ex 



tended distribution for the metals due to galactic winds and a more centrally con- 
centrated distribution in case of ram-pressure stripping. Furthermore, winds seem to 
be more efficient in early epochs when the star formation rate was higher, while ram - 
pressure starts to play an important role below redshift 1-2 (e.g. . IKapferer et al.ll2007T ). 
In the past, the general idea was that in cluster outskirts winds are more important 
than ram-pressure stripping because, on the one hand, there is lower ICM pressure 
around the galaxies to confine the metals, and, on the other hand, the ram-pressure 
stripping increases when the ICM density is higher. There is cle ar evidence nowa- 
days that ram-pre s sure stripping is acting also in the outskirts (e.g., Crowl et al .11 20051 : 
iBoselli et al.1l200d : ICortese et al1l2007l : ISun et al.ll2007l : IVollmerll2009h . 

The only way to determine the chemical composition of the intracluster medium is 
through X-ray observations. Due to its large effective area, good spectral and spatial 
resolution, XMM-Newton is currently the best instrument to study the spatial distri- 
bution of metals in the ICM. Due to its high background level, it has only been used 
to determine the abundances in the inner and intermediate par ts of clusters, though, 
where the emission is brightest (e.g.. Eeccardi fc Mol endi 2008a). In cluster outskirts, 
the temperature determination, and so the metal measurements, are easily aff ected 
by the level of subtracted background (e.g., Ide Plaa et all I2OO6I ; ILeccardi fc Molendil 
2008bJ). Thus, a large fraction of the clusters' volume is still unexplored. 

Recently, X-ray observations with Suzaku, thanks to its much lower instrumen- 
tal background than that of other X-ray observatories, have pushed m easurements 
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(2011) showed that the outer regions of clusters are considerably metal-enriched, to a 
level of about 0.2-0.3 of the solar metalhcity, wh ich is the typical v alue observed in 
intermediate cluster regions. On the other hand. llJrban et all 1 20lih measured with 
high statistical significance (5.5cr) a lower metalhcity of 0.11±0.02 solar at the virial 
radius of the Virgo cluster using XMM-Newton data. 

The physics of the ICM outskirts must be understood to avoid biases in the chem- 
istry determination (e.g., Figs. 1161 right, and !19[) . For instance, gas clumping can lead 
to an overestimate of the gas density (S ection l4.8l) and an underestimate of the temper- 
ature (Section l4.6p . lUrban et al] 1 20 111 ) argue that unresolved multitemperature struc- 
ture in the low temperature outskirts of the Virgo cluster may lead to an underestimate 
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of the metallicity. Thus, the metal mass inferred from the metallicity measurement may 
be a lower limit. 

Another source of bias in the metal mass determination can be the inhomoge- 
neous metallicity distribution. Due to the fact that the enriched material is not mixed 
immediately with th e ICM, metal blobs and stripes can be observed. For instance, 
lLovisari et al.l ( 201 ll ) showed with XMM-Newton observations of five cool-core clus- 
ters that even in relaxed clusters the distribution of metals shows significant inhomo- 
geneities. Using metal maps they evaluate that most previous metal mass determina- 
tions have underestimated the metal mass by (10-30)% due to this inhomogeneous 
distribution. 

The spatial distribution of metals is linked to galactic evolution (e.g., supernova- 
driven galactic winds, outflows due to AGN, and galaxy-galaxy interactions), galaxy- 
ICM interactions (e.g., ram-pressure stripping), and gas dynamics (e.g., gas drifting, 
turbulence, convection, and mixing). Important advances in un derstanding the s e pro- 
cesse s have been made over the past few years. For instance, iRebusco et all (2005, 
2006) found that the optical light profiles are much more peaked than the abundance 
profiles, while in absence of mixing they should follow a similar distribution. This sug- 
gests that metals originating in the central regions are transported out to large radii 
by gas motions. Including AGN feedback in the simulations leads to iron profiles in 
better agreement with observations because th ey are able to move over a large scale 
the gas enriched within the galactic halos (e.g.. iFabian et al.ll201ll ). The role of AGN 
in the metal transport has been investigated also bv lChurazov et alJ (2001). They pro- 
posed a scenario where radio bubbles, produced by the jets of the AGN, uplift the cold 
en riched gas out to l arge d istances. This scenario has been confirmed observationally 
bv lSimionescu et al.l (|2008l ) analyzing XMM-Newton data of the cluster Hydra- A. 

The ICM is rarely in perfect hydrostatic equilibrium because large-scale accretion 
of matter, turbulent motions, gas sloshing, AGN, and galaxy motions are consider- 
ably and regularly perturbing the gas (e. g., Schuecker et al.l 2004 L Henry et al.l 2004; 



ably and regularly p erturbing the gas (e.g., bchuccke r et al.l 120041. lilenrv et al.l 120041 
IVogt fc Enfihr]|2005l : liirfflin fc Vogtll2006l : ISanders et al.ll20ld : iRoedieer et al.ll2012l l 
Since both large- and small-scale mixing properties of galaxy clusters are at work at 
the same time during the whole cluster evolution, it is currently difficult to obtain a 
complete and consistent picture of the evolving ICM. To shed light on the process of 
metal injection we need to resolve the metal distribution down to the small scale of 
mixing, and this requires the large effective area of an envisaged Athena-like mission. 



6 Technical issues 

6.1 X-rays 

6.1.1 How to deal with foreground and background emission in low surface brightness 
regions? 

The cluster surface brightness decreases fast with increasing radius, making measure- 
ments of gas properties more difficult in cluster outskirts. A gas density measurement 
can be accomplished with much poorer signal-to-noise data than a temperature mea- 
surement. For instance, one can pick an energy band where the surface brightness, 
Sx, is almost independent of temperature for hot clusters, e.g., (0.5-2.0) keV (Fig. [2]). 
Then determine the surface brightness, or rather the number of source photons in this 
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energy band in some annulus divided by exposure time and effective area. And finally, 
obtain the density by inverting 



Also note the strong redshift dependence due to cosmological surface brightness dim- 
ming. 

On the other hand, Figures l2l and [171 show that measuring the temperature of hot 
(fcsTe ^ 5 keV) clusters requires high quality spectral data at high energies (E > 5 
keV) because of the absence of the Fe L shell line complex around 1 keV and because 
the exponential bremsstrahlung cutoff shifts to high energies. High quality data means 
good source photon statistics as well as low and well-determined fore- and background 
surface brightness. We will see that, at high energies, the so-called particle background 
dominates if times of so-called soft proton flares have been accurately excluded. 

X-ray optics work only for small photon incidence angles (grazing incidence). The 
higher the photon energy, the smaller the critical angle for reflection. For a given focal 
length, this results in decreasing effective area with increasing energy as high energy 
photons cannot be focused by outer mirror shells anymore. In summary, for a hypo- 
thetical intrinsically flat spectrum X-ray source, the detected signal decreases rapidly 
for higher photon energies. This is also true for astrophysical foreground/background 
components but not for other types of background, as dicussed below. 

The particle background (events not caused by astrophysical X-ray sources but 
indistinguishable from them) spectrum registered on the detector is typically fairly 
flat, so will always become dominant above some energy. This threshold energy de- 
pends strongly on the instrument setup, including satellite orbit, focal length, presence 
of anti-coincidence devices etc. The ambient particle background level is lower in a 
low-Earth orbit (LEO, e.g., ROSAT, ASCA, Suzaku, Swift, Astro-H) compared to a 
highly-elliptic orbit (HEO, e.g., Chandra, XMM-Newton) or Lagrange point 2 (L2, e.g., 
eROSITA) orbit because satellites are naturally shielded by Earth's magnetosphere. 
Particle background also roughly scales with focal length squared and the number 
of mirror/detector systems (e.g., Chandra has a focal length of 10m and one mirror 
system while eROSITA has seven mirror systems and a focal length of only 1.6m). Anti- 
coincidence devices (e.g., ROSAT PSPC) can currently not be applied very efficiently 
to X-ray CCDs. Fluorescent lines due to detector material (e.g., Al, Co) induced by 
particle background can be suppressed by applying graded-Z shields (e.g., eROSITA). 
In general, if the particle background is underestimated during analysis then the result- 
ing temperature profile will often appear to decline less steeply with radius in cluster 
outskirts (e.g., Fig. 1 in lde Plaa et al.ll2006l ) because background becomes increasingly 
important at larger radii and particle background dominates at high energies, resulting 
in an apparent shift of the ICM bremsstrahlungs cutoff towards higher energies; this 
would then give rise to an overestimated total mass (Fig. 

Satellites in orbit mostly outside Earth's magnetosphere (e.g., Chandra and XMM- 
Newton), are occasionally (about 30% of the time) bombarded with so-called soft pro- 
tons. They can hit the focal plane instruments when entering through the mirrors, 
increasing the background by a factor of up to ~100 or more. For analysis of extended 
sources, times during such flares need to be removed as far as possible from the data. 
This works reasonably well using lightcurves since X-ray CCDs have good time resolu- 
tion (analysis is being done on a data-hypercube - each individual detected event gets 
a position, an energy, and an arrival time assigned). However, low level flares can go 




(19) 
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undetected in lightcurves and cause a background signal that needs to be taken care 
of in subsequent spectral analyses. Satellites in LEO, like Suzaku, do not suffer from 
such flares. 




Fig. 20 The Milky Way seen at a wavelength of 21 cm . Provided by P. Kal berla and repro- 
duced with permission (Leiden/Argentine/Bonn Survey, iKalberla etalll2005n . 




Fig. 21 The Milky Way as seen in soft X-rays (~l/4 keV). Notice the anti-correlation with 
the Galactic neutral hydrogen map above. Data from the ROSAT All-Sky Survey (see, e.g., 
Snowdcn ct al. 2008). The yellow circles indicate the same X-ray bright galaxy clusters as in 
Fig. OH 



The dominant soft (E < 1 keV) X-ray background is due to thermal emission of 
optically thin gas (bremsstrahlung and line emission) and charge exchange emission 
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(line emission). The former varies with pointing direction and is particularly strong 
at low Galactic latitudes and the latter varie s with time. The primary components 
are assumed to be (e.g.. ISnowden et alj|200ih : Possibly, emission from the local hot 
bubble - hot gas left over from one or more long gone supernovae, which exploded 
in the vicinity of the Sun (k^Tx ~ 0.1 keV). Galactic emission from the disk and 
Galactic halo (fcgTx ~ (0.1-0.2) keV). Possibly, local group emission (k^Tx ~ 0.2 
keV). Emission from large-scale Galactic objects like supernova remnants and from a 
large number of unresolved stars at low Galactic latitudes. And, last not least, time 
variable solar wind charge exchange emission from the geocorona and/or heliosphere, 
which may be responsible for a significant fraction of th e emission previously attributed 
to the local hot bubble (e.g.. iKoutroumpa et al.ll201lh . 

Furthermore, there is a cooler interstellar medium (ISM) in our Galaxy, as traced, 
e.g., by neutral hydrogen. Since the ISM has been enriched by metals, when there is 
hydrogen then there are also heavy elements, causing X-ray absorption. So, hydrogen 
maps can be used to estimate Galactic X-ray absorption for distant sources. The anti- 
correlation seen in Figs.[20]and[2l]indicates that a significant fraction of the soft X-rays 
does not originate in the local hot bubble or from solar wind charge exchange but comes 
from farther away (e.g., IPietz et al.lll99Sl ). 

Last not least, at intermediate energies, above E > 1 keV and below the particle 
background threshold energy, the sum of emission from distant undetected AGN dom- 
inates. This component can be reduced if the instrument has good spatial resolution 
(e.g., Chandra), which allows to detect point sources down to very low fluxes. 

All of these fore- and background components have to be subtracted or modelled 
with incr easing accuracy the fu rther out one goes into the cluster outskirts. E.g., Fig. 2 
(right) of lReiprich et al.l (|2009h shows that in an annulus approaching r2oo of the cluster 
A2204, the cluster emission is subdominant at almost every energy and different fore- 
and background components dominate different parts of the spectrum. Methods need to 
be optimized for each instrument and observational situation to obtain robust results, 
particularly on temperature measurements. The characterization of the X-ray emission 
from outskirts of clusters at low Galactic latitudes is especially challenging. A useful 
approach is also to combine Chandra and XMM-Newton observations for point source 
detection with Suzaku observations for the faint cluster emission. However, since AGN 
are variable sources, such observations would ideally be performed simultaneously. 
Moreover, a future X-ray mission could be optimized for cluster outskirts studies by 
combining good spatial and spectral resolution over a large field-of-view with large 
effective area and short focal length in a low-Earth orbit. 

6.1.2 How to deal with the Suzaku point-spread-function and stray light? 

Currently, Suzaku is the best X-ray instrument to determine the temperature and 
metal abundance of the ICM in cluster outskirts. In addition to the general fore- and 
background issues discussed above, the particular challenges for the Suzaku analysis 
include PSF and stray light correction. While Suzaku's PSF does not strongly depend 
on photon energy - a significant advantage over ASCA - it is broad, about 2 arcmin 
half-power-diameter, resulting in contamination of the observed spectrum in a given 
annulus from neighboring shells, e.g., from a bright cluster core. This c an be corrected 
for by ray tracing simulations (e.g., using xissim, llshisaki et alJl2007h given a finely 
binned cluster surface brightness and temperature profile. I.e., if one knows exactly 
how the cluster looks like, one can accurately correct for PSF effects. So, one has to 
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stick in what one actually wants to measure. Therefore, an iterative approach needs 
to be taken and, again, additional Chandra or XMM-Newton data to model the inner 
regions are very helpful. Furthermore, the PSF limits the determination of reasonably 
resolved temperature profiles to low redshift clusters (z < 0.3). 

Also stray light can be an issue; i.e., single reflections of photons from sources 
outside the field-of-view that reach the detector. This is particularly true for very low 
redshift clusters that require multiple Suzaku pointings to cover a significant radial 
range in terms of r200j but any observation can be contaminated if there is a bright 
source nearby. This can, in principle, be corrected for with ray tracing simulations but 
this approach is limited by the accuracy of the underlying calibration. Moreover, the 
observing strategy can be adjusted to minimize stray light effects. Off-axis observations 
of the Crab nebul a have shown this to be the case if the CCD corners point towards 
the bright source (|Takei et alJl2012h . 



6.2 SZ 



6.2.1 Direct measurement of the cluster pressure profile 

As seen in eq. Q the SZ signal is proportional to the product of electron density and 
temperature integrated along the line-of-sight; therefore, deprojection of a resolved 
SZ profile will directly yield the three-dimensional pressure profile for a cluster. In 
practice, such direct inversion of an SZ image is difficult to perform due to the poor 
resolution of the current SZ experiments, and also due to the difficulty in obtaining 
filtering-corrected SZ profiles out to large radius without assuming some kind of model. 
A majority of studies to date have, therefore, made use of simultaneous fitting of SZ and 
X-ray dat a using some parametric model for the gas den s ity and temperature dis tribu- 
tion (e.g. jMahdavi et alJliob^jMroczkowski et al"ll2009l . iBonamente et alj|2012f ). The 
generalized NFW model is currently considered to be the best description of the shape 
of the intrac luster gas pressure, and is constrained with high precision from X-ray data 
insid e r.^nn jArnaud et a~ 201fj). Som e early attempts at non-parametric comparison 



(e.g., |jia et al' I I2008I . iBasu et aljboich also found generally good agreement between 



the X-ray prediction of pressure and the SZ measurements. It must be mentioned that 
data from single clusters have too little constraining power to find systematic differ- 
ences between X-ray and SZ measurements at large radius, and a more meaningful 
comparison can be obtained after stacking the SZ signal from several clusters. Early 
stacking results do not show any statistically signifi cant deviation (Section 14.1.31 and 
IPlagge et ail bold . [Planck Collaboration et al.l 120131 ), although such comparisons are 
still in their early stages. 



6.2.2 Density and temperature profiles from joint X-ray /SZ modelling 

A more direct analysis of the state of the intracluster medium at large radius is possible 
if X-ray and SZ observations are used together to model the density and temperature 
profiles simultaneously. Since these two observables have a different dependence on 
n c and T e , it is in principle straightforward to solve for them using X-ra y and SZ 
data. Such joint modelling was originally proposed by ISilk fc W hite! J l978h and has 
been applied to simulated data and analytical mo dels (e.g.. lYoshikawa fc Sutolll999l : 
IPuchwein fc Bartelm ann 200r3; lAmeglio et alj|2009h but application to real cluster data 
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Fig. 22 Temperature profile for the cluster Abell 220 4 obtained t hroug h a joint X-ray/SZ 
analysis (black squares). Reprinted with permission from lBasu et al.l 1120101 ). X-ray information 
includes only the surface brightness, no spec tra. The red diamo nds result from the spectral 
XMM-Newton analysis of the same cluster bv lZhang et al.l fl2008T ). 



have been very few jKitavama et al.l |2004 iNord et alj 120091 ; iBasu et alj|20ich . The 



major difficulty lies in obtaining high-quality SZ data with sufficient resolution, and 
to compensate SZ flux loss due to filtering. Once an SZ profile is obtained, one can 
simply use the X-ray surface brightness from the soft energy band (roughly 0.5-2 
keV) to model first the gas density, where the X-ray cooling function is practically 
insensitive to gas temperature in a hot cluster (Fig. [2|, and then use it to obtain the 
gas temperatures from the SZ data. The potential advantage lies in the fact that the 
X-ray surface brightness profile can be obtained reliably with a smaller number of 
photons than required for an accurate spectral measurement of the gas temperature. 
This technique, therefore, opens a new window to the cluster outskirts by measuring 
the temperature (and mass) of the ICM independently of X-ray spectroscopy (e.g., 
Fig. [22]). 

One difficulty with the above method is that any biases incurred in the X-ray 
brightness profile measurement will affect the SZ-derived temperature measurement 
with opposite bias. The SZ data alone cannot constrain the gas temperature, as it 
measures only the integrated pressure at different scales. The density normalization 
must come from X-ray brightness profile measurements. Therefore, if the X-ray emis- 
sion is boosted, e.g., due to the presence of gas clumping, the temperature profile 
inferred from X-ray/SZ joint analysis will show a steeper decline. This will complicate 
the comparison between SZ-derived temperature and X-ray spectroscopic tempera- 
ture, particularly in the cluster outskirts where clumping effects may be significant 
(Section gXTJ. 

6.2.3 How to measure profiles with poor spatial resolution? 

As already mentioned, current comparisons between the X-ray and SZ measurements 
are mainly limited by the low angular resolutions of the SZ experiments. SZ survey 
telescopes generally operate with a beam size of ~1 arcmin, and the Planck surveyor has 
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an effective beam size of 7 arcmin for SZ science. Massive clusters at very low redshifts 
[z <• 0.1) are not suitable for imaging with ground based SZ instruments due to their 
large angular dimensions, as the limited field-of-view of bolometer cameras make the 
removal of atmospheric noise difficult. Interferometers act like a high-pass filter of 
the sky signal and are generally unsuitable for imaging large objects. In addition, 
single-frequency SZ experiments are affected by the confusion from primordial CMB 
fluctuations in their reconstruction of the SZ signal at large radii. 

On the other hand, most X-ray spectroscopic studies of cluster outskirts have been 
done with very low redshift targets (Section I4.1.2p . either to have sufficient signal-to- 
noise over X-ray fore- and backgrounds (Section I6.1.1|l , or to avoid the complicated 
PSF smearing effect of the Suzaku satellite (Section I6.1.2|) . Some of the low redshift 
cluster profiles have now been measured by the Planck satellite with sufficient accu- 
racy (Section I4.1.3|) and can be compared or combined with low-redshift X-ray data 
(Section 14.1.40 . But the low resolution Planck results need to be tested against finer 
resolution ground-based instruments for possible systematic biases in their beam de- 
convolution or multi-frequency matched filtering techniques. Future high resolution SZ 
instruments like CARMA and the Cerro Chajnantor Atacama Telescope (CCAT) will 
provide sufficient angular resolution for intermediate redshift clusters, and CCAT will 
also have a large field-of-view to image some of these z < 0.1 objects. 

The other aspect of reliable SZ imaging at large angular scales is the development 
of effective deconvolution methods, to account for the loss of signal due to the multiple 
high-pass filters employed on the data to remove atmospheric noise and the CMB con- 
tributions. While fitting a model to the data such filtering renders the model sensitive 
to only a very limited spatial range of the cluster SZ signal, typically near its central 
region. Thus computing the integrated SZ signal or measuring the pressure gradient 
in the outskirts is done from model extrapolation into the domain where the actual 
measurements have no impact. We remind that the situation u sed to be somewhat 
similar in X-ray-based analyses of cluster pressure profiles (e.g., lArnaud et al.l EiOlO'l 
where the pressure slope beyond rsoo was fixed by simulations. With Suzaku, this is 
now changing. Model independent (non-parametric) reconstruction of the full SZ signal 
has so far been atte mpted by two different methods: by making a direct deconvolution 
of the filtered map ( Nord et al.ll2009l : ISavers et al 1 l201lh and by an an alytic inversion 
of the radial profile assuming spherical symmetry |Plagge et alj l2010h . These studies 
are currently limited by the SZ instrument sensitivities and low number statistics, al- 
though rapid progress is being made and future SZ/X-ray comparisons of the cluster 
outskirts hold much potential. 



6.3 Weak lensing 

6.3.1 How to deal with projection effects? 

Wide-field imaging cameras allow us, in principle, to trace the weak lensing signal of 
a typical cluster out to several virial radii. However, the weak lensing signal of any 
individual cluster blends into the cosmic shear background arising from the combined 
shear of the intervening large-scale structure at much smaller cluster centric separatio ns 
than technically accessible. From the early simulation-based work bv lHoekstral ( 2003T I . a 



rule-of-thumb could be drawn that the effect of projected large-scale structure becomes 



a considerable source of error for separations > 10 arcmin from the cluster center, even 
for massive clusters at redshifts yielding a high lensing efficiency. 

Recently, due to the improved capabilities for ray tracing th rough cluster iV-body 
simulations, this issue has found more wide spread attention fe.g-. lHoekstra et al.ll201ll : 
Becker fc Kravtsovll201l|; Gruen et al.||20 1lh. Based on ray trac i ng results from the Mil - 



Hilb ert et ail (20091 ). iHoekstra et~al] (|201ll ) 
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ll. 120031 ) that accounting for large-scale 



lennium simulation ( Springel et alj 1200- 
confirmed their earlier results ( Hoekstr 
structure lensing is cruci al when study i ng th e mass profile in clus t er out skirts. Start- 
ing with a simulation bv lTinker et al.l (|2008l ). feecker fc Kravtsovl l|201ll ) investigated 
the biases and uncertainties arising from large-scale structure uncorrelated to the clus- 
ter (and also from cluster triaxiality) and find large-scale structure to add significant 
scatter to weak lensing mass estimates obtained from profile fitting. They arrive at 
an additiona l unce rtaint y induced by large-sca le structure consisten t with the findings 
of IHoekstral (|2003l ) and Irloekstra et alj j201 ll ). iGruen et all |201ll ). who draw atten- 
tion to the influence of structure correlated to the cluster of interest, also corroborate 
this picture: shear profiles are increasingly dominated by large-scale structure scatter 

beyond a separation of ~10 arcmin. 

Again using ray tracing simulations ( Sato et all 120091 ) , lOguri fc Hamarial ( 201 ll ) 
considered the outer cluster profile itself, which they find be st described by a modi- 
fied NFW profile with a smoothed cu t -off (Baltz et alj 12009). motivated by the halo 
model. Furthermore, if a iBaltz et alj (200E) profile were fitted with an NFW func- 
tion to an outer boundary of (~10— 30) arcmin, the consequence would be a (5-10)% 
underestimate of the cluster mass, while its concentration would be overestimated. 

How can one incorporate these simulation results in practice? For cluster masses 
obtained by fitting a parametric function (e.g., NFW) to the shear profile, shear by 
large-scale structure can be taken in to account as an additional component in the error 
estimation. Ilsrael et al.l ( 20ld . l2012l ) provide a practical example for this met hod, using 
an est imation of large-scale structure contribution from the simulations of IHoekstral 
l|2003h . 

Weak lensing tomograph y aims at direct dis e ntanglement of lensing structures at 
different lens redshifts (e.g.. iMassev et all 120071 : ISchrabback et "al]|201ol . for applica- 
tions). So, in principle, contributions from uncorrelated projected large scale-scale 
structure could be measured and individual cluster mass profiles corrected for them. 
Accurate photometric redshifts requiring deep imaging in several bands is necessary 
to achieve resolution in the redshift dimension (although a poorer one than in the two 
"celestial" dimensions). 

Due to the inevitable lensing by projected, unrelated structures, an accurate weak 
lensing mass measurement is currently limited to massive (M> 10 14 M o ) individual 
clusters, though. Stacking techniques, however, make a wider range of the cluster mass 
function accessible to weak lensing. More importantly, cluster stacking averages out the 
effects of substructur e and departure from spherical symmetry, thus defining a "typical " 
(outer) mass profile (|Mandelbaum et al.ll200r3 ; ISheldon et~ai]|2009l ; lOkabe et aljfeoioh . 
Now that an axisymmetric mass distribution can be assumed, the quantity 



AE{6) = r cl . it7t (0) = E(<0) - E{6) , (20) 

with E(< 6) the mean projected density within the radius 8, provides an estimate of 
the projected surface mass density E(9). However, when stacking clusters, special care 
has to be taken to the determination of the cluster center. 
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Combining weak lensing shear a nd magnification measureme nts for some of the 
most massive clusters in the Universe, lUmetsu et all [|2011bll2011ah compute mass pro- 
files, and average them in a stacked profile. Using their Bayesian method, which breaks 
the mass-sheet degeneracy by including lensing magnification data, these authors find 
the logarithmic slope of the mass profile to steepen towards the center, over a large 
radial range, consistent with the NFW profile. 



7 Outlook 



As illustrated in this review, new instruments and improved theoretical modelling have 
resulted in significant progress in understanding the physics and chemistry of the in- 
tracluster medium as well as cluster mass profiles beyond r^oo- Also, new questions 
emerged. With upcoming SZ (e.g., Atacama Large Millimeter/submillimeter Array, 
ALMA; CCAT) and X-ray instruments (e.g., Astro-H; eROSITA), it will likely be 
possible to clearly disentangle, which of the suggested physical and enrichment pro- 
cesses dominate in cluster outskirts, which will in turn inform modelling of cluster and 
galaxy evolution. Still, we are far from reaching the really interesting regime, the region 
beyond the cluster virial radius (>rioo)> diving into the expected filamentary struc- 
ture. A stacking analysis of eROSITA data at cluster positions in mass and redshift 
bins seems promising. Still, for the gaseous component, new optimized instruments 
will likely need to be constructed to study individual systems. In the X-ray regime, 
breakthroughs could be achieved with a large effective area, Athena-like, mission. For 
a more accurate weak lensing determination of cluster mass profiles out to the filamen- 
tary regime, there is hope that with ongoing and upcoming deep, large-area imaging 
surveys (e.g., Kilo Degree Survey, KiDS; Dark Energy Survey, DES; Euclid; Large Syn- 
optic Survey Telescope, LSST), detailed stacking analyses can be performed. This will 
constrain structure formation models and possibly also the nature of dark matter. Let's 
move on to an exciting future. 
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